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MOLECULAR SCAFFOLDS FOR KINASE LIGAND DEVELOPMENT 

BACKGROUND OF THE INVENTION 

[0001] This invention relates to the field of development of ligands for protein kinases. 

[0002] Examples of protein kuaases are the PIM kinases, including PIM-l, PIM-2, and 
PIM-3. The PEM-l proto-oncogene was originally identified as a genetic locus frequently 
activated by the proviral insertion of Moloney murine leukemia virus into mouse T cell 
lymphomas (Cuypers, H. T., Selten, G., Quint, W., Zijlstra, M., Maandag, E. R., Boelens, 
W., van Wezenbeek, P., Melief, C, and Bems, A. (1984) Murine leukemia virus-induced T- 
cell lymphomagenesis: integration of proviruses in a distinct chromosomal region. Cell 
37:141-150), The PIM~1 proto-oncogene has also been implicated in human hematopoietic 
malignancies with its overexpression frequently detected in human hematopoietic cell lines 
as well as in fresh tumor cells from patients with leukemia (Nagarajan L, Louie E, 
Tsujimoto Y, ar-Rushdi A, Huebner K, and Croce CM. (1986) Localization of the human 
PIM oncogene (PIM) to a region of cliromosome 6 involved in translocations in acute 
leukemias. Proc. Natl Acad, Set USA 83:2556-2560; Meeker TC, Nagarajan L, ar-Rushdi 
A, Rovera G, Huebner K, and Croce CM. (1987) Characterization of the human PIM-1 
gene: a putative proto-oncogene coding for a tissue specific member of the protein kinase 
family. Oncogene Res. 1 : 87-101 ; Amson R, Sigaux F, Przedborski S, Flandrin G, Givol D, 
and Telerman A.(1989). The human proto-oncogene product p33PIM is expressed during 
fetal hematopoiesis and in diverse leukemias. Proc, Natl Acad. ScL USA 86: 8857-8861). 

[0003] The PIM family of proto-oncogenes in human and mouse now consists of at least 
three members, that code for highly related serine/threonine specific protein kinases (Saris 
CJ, Domen J, and Bems A. (1991) The PIM-1 oncogene encodes two related protein- 
serine/threonine kinases by alternative initiation at AUG and CUG. EMBO J, 10: 655-664; 
Eichmann A, Yuan L, Breant C, Alitalo K, and Koskinen PJ. (2000) Developmental 
expression of PIM kinases suggests fianctions also outside of the hematopoietic system. 
Oncogene 19: 1215-1224). The ftmction of these three kinases (PIM-1, PIM-2 and PIM-3) 
appear to complement each other in mice, as deletion of one of the PIM family protein 
genes did not result in any severe defects (Laird PW, van der Lugt NM, Clarke A, Domen J, 
Linders K, McWhir J, Bems A, Hooper M. (1993) In vivo analysis of PIM-1 deficiency. 
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Nucl Acids Res. 21:4750-4755), During embryonal development PIM genes are expressed 
in partially overlapping fashion in cells in both immime and central nervous system as well 
as in epithelia (Eichmann A, Yuan L, Breant C, Alitalo K, and Koskinen PJ. (2000) 
Developmental expression of PIM kinases suggests functions also outside of the 
hematopoietic system. Oncogene 19: 1215-1224). PIM-1, the prototypical member of the 
PIM family is located both in the cytoplasm and nucleus, but its precise role in these two 
locations has not been fully elucidated. 

[0004] Transgenic mice' with PIM~1 driven by Emu enhancer sequences demonstrated that 
PIM-1 function as a weak oncogene because by itself it does not lead to tumor formation 
but does so after a second oncogenic gene become overexpressed. In 75% of the tumors 
over-expressing PIM-1, the second gene found to be over-expressed is c-myc (van der 
Houven van Oordt CW, Schouten TG, van Krieken JH, van Dierendonck JH, van der Eb 
AJ, Breuer ML.(1998) X-ray-induced lymph'oinagenesis in E mu-PIM-1 transgenic mice: an 
investigation of the co-operating molecular events. Carcinogenesis 19:847-853). In fact 
when crosses were made between Emu-PIM transgenic mice and Emu-myc transgenic mice, 
the combination of genes is so oncogenic that the offsprings die in utero due to pre B cell 
lymphomas (Verbeek S, van Lohuizen M, van der Valk M, Domen J, Kraal G, and Bems A. 
(1991) Mice bearing the Emu-myc and Emu-PIM- 1 transgenes develop pre-B-cell leukemia 
prenatally. MoL Cell Biol, 11: 1176-1179). 

[0005] Mice deficient for PIM-1 show normal synaptic transmission and short-term 
plasticity but failed to consolidate enduring LTP even though PIM-2 and PIM-3 are 
expressed in the hippocampus (Konietzko U, Kauselmaim G, Scafidi J, Staubli U, Mikkers 
H, Bems A, Schweizer M, Waltereit R, and Kuhl D.(1999) PIM kinase expression is 
induced by LTP stimulation and required for the consolidation of enduring LTP. EMBOJ, 
18: 3359-3369). 

[0006] Various factors are known to enhance the transcription of PIM-1 kinase in mouse 
and human. PIM-1 closely cooperates with another oncoprotein, c-myc, in triggering 
intracellular signals leading to both transformation and apoptosis and the selective inhibition 
of apoptotic signaling pathways leading to Bcl-2 (van Lohuizen M, Verbeek S, Krimpenfort 
P, Domen J, Saris C, Radaszkiewicz T, and Bems A. (1989) Predisposition to 
lymphomagenesis in PIM-1 transgenic mice: cooperation with c-myc and N-myc in murine 
leukemia virus-induced tumors. Cell 56:673-682; Breuer ML, Cuypers HT, Bems A. 
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(1989). Evidence for the involvement of PIlVI-2, a new common proviral insertion site, in 
progression of lymphomas. EMBOJ, 8:743-748.; Verbeek S, van Lohuizen M, van der Valk 
M, Domen J, Kraal G, and Bems A. (1991) Mice bearing the E mu-myc and E mu-PIM-1 
transgenes develop pre-B-cell leukemia prenatally. MoL Cell. BioL 11 : 1 176-1 179; 
Shirogane T, Fukada T, MuUer JM, Shima DT, Hibi M, and Hirano T. (1999) Synergistic 
roles for PIM-1 and c-Myc in STAT3-mediated cell cycle progression and antiapoptosis. 
Immunity, 1 1 : 709-71 9), PIM-1 kinase is induced by T cell antigen receptor cross linking 
by cytokines and growth factors and by mitogens including IL2, IL3, IL6, IL9, IL12, IL15, 
GM-CSF, G-CSF, IFNa, DSTFg, prolactin, ConA, PMA and anti-CD3 antibodies (Zhu N, 
Ramirez LM, Lee RL, Magnuson NS, Bishop GA, and Gold MR.(2002) CD40 signaling in 
B cells regulates the expression of the PIM-1 kinase via the NF-kappa B pathway, J 
Immunol 168: 744-754). PIM-1 expression is rapidly induced after cytokine stimulation 
and the proliferative response to cytokines is impaired in cells from PIM-1 deficient mice 
(Domen J, van der Lugt NM, Acton D, Laird PW, Linders K, Bems A.(1993) PIM-1 levels 
determine the size of early B lymphoid compartments in bone marrow. J. Exp. Med. 178: 
1665-1673). 

[0007] Recently, it has been reported that PIM family of kinases interact Avith Socs-1 
protein, a potent inhibitor of JAK activation thereby playing a major role in signaling down 
stream of cytokine receptors. The phosphorylation of Socs-1 by PIM family of kinases 
prolongs the half-life of Socs-1 protein, thus potentiating the inhibitory effect of Socs-1 on 
JAK-STAT activation (Chen XP, Losman JA, Cowan S, Donahue E, Fay S, Vuong BQ, 
Nawijn MC, Capece D, Cohan VL, Rothman P. (2002) PIM serine/threonine kinases 
regulate the stability of Socs-1 protein. Proc. Natl Acad. Set USA 99:2175-2180.). PIM-1 
is expressed during Gl/S phase of the cell cycle suggesting that it is involved in cell cycle 
regulation (Liang H, BSttehnan Nagarajan L., Ubiquitous expression and cell cycle 
regulation of the protein kinase PIM-1 . (1996) Arch Biochem Biophys, 330:259-265), ). 
PIM-1 kinase activity and the protein level is increased in CD 40 mediated B cell signaling 
and this increase in PIM-1 level is mediated through the activation of NF-kB (Zhu et al. 
2002- supra). PIM-1 can physically interact with NFATc transcription factors enhancing 
NFATc dependant transactivation and IL2 production in Jurkat cells (Rainio EM, Sandholm 
J, Koskinen PJ. (2002) Cutting edge: Transcriptional activity of NFATcl is enhanced by 
the PIM-1 kinase. J. Immunol 168:1524-1527). This indicates a novel phosphorylation 
dependant regulatory mechanism targeting NFATcl through which PIM-1 acts as down 
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Stream effector of ras to facilitate IL2 dependant proliferation and survival of lymphoid cells 
(M). 

[0008] PIM-1 is shown to interact with many other targets. Phosphorylation of Cdc25A 
phosphatase, a direct transcriptional target of c-myc, increase its phosphatase activity both 
in-vivo and in-vitro indicating that Cdc25A link PIM-1 and c-myc in cell transformation and 
apoptosis (Mochizuki T, Kitanaka C, Noguchi K, Muramatsu T, Asai A, and Kuchino Y. 
(1999) Physical and functional interactions between PIM-1 kinase and Cdc25A 
phosphatase. Implications for the PIM-1 -mediated activation of the c-Myc signaling 
pathway; J. Biol. Chem, 274:18659-18666). PIM-1 also phosphorylate PTP-U2S, a tyrosine 
phosphatase associated with differentiation and apoptosis in myeloid cells, decreasing its 
phosphatase activity and hence preventing premature onset of apoptosis following PMA- 
induced differentiation (Wang et al. (2001) Pim-1 negatively regulates the activity of PTP- 
U2S phosphatase and influences terminal differentiation and apoptosis of monoblastoid 
leukemia cells. Arch. Biochem. Biophys. 390:9-18). The phosphoiylation of pi 00, a co- 
activator of c-myb (Weston, 1999, Reassessing the role of C-lVTi'B in tumorigenesis. 
Oncogene 18:3034-3038), by ^BM-l is involved in Ras-dependent regulation of 
transcription (Leverson JD, Kc ?kix^en PJ, Onico FC, Rainio EM, Jalkanen KJ, Dash AB, 
Eisenman RN, and Ness SA. (1998) PIM-l kinase and pi 00 cooperate to enhance c-Myb 
activity. Afo/. Cell 2: 417-425). The phtusphorylation of another PIM-1 target, 
heterochromatin protein 1(HP1) has \a?av. shown to be involved in transcription repression 
(Koike N, Maita H, Taira T, Ariga H, IgucM-Ariga SM. (2000) Identification of 
heterochromatin protein 1 (HPl) as a phosphorylation target by PIM-1 kinase and the effect 
of phosphorylation on the transcriptional repression function of HP-1 (1), FEBSLetL 467: 
17-21). 

[0009] The information provided above is intended solely to assist the understanding of 
the reader. None of the information provided or references cited is admitted to be prior art 
to the present invention. 

SUMMARY OF THE INVENTION 

[0010] The present invention concerns molecular scaffolds that can be used to identify 
and develop ligands active on one or more kinases, for example, the PIM kinases, {e.g., 
PDvl-l, PIM-2, and PIM-3). Compounds representing the present molecular scaffolds have 
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been co-crystallized with PlM-1, and the co-crystal structures have been determined to 
confirm the orientation of the compound within the binding site. In addition, such 
compounds also bind to other kinases, such that the scaffolds can be used for ligand 
development for other kinases also. Li the description herein, PIM-1 and the use of 
molecular scaffolds and ligands with PIM-1 are described as examples, but the invention is 
not limited to PIM-1. 

[OOH] In a first aspect, the invention provides a kinase scaffold library comprising at least 
one set of compounds of a chemical structure selected from the group consisting of Formula 
I, II, m, IV, V, VI, and Vn as described herein. Formula I, H, and m as described in U.S. 
Appl. 10/664,421 and corresponding PCT/US03/29415, and Formula I as described in U.S. 
AppL 10/789,818 and corresponding PCT/US2004/005904, all of which are incorporated 
herein by reference in their entireties. (Unless specifically indicated to the contrary, 
reference to any of Formulas I-VU means the Formulas I-VH described with a generic 
structure herein.). In certain embodiments, the scaffold library contains at least one set of 
compounds having chemical structures of Formula I, H, in, IV, V, VI, or VII. 

[0012] Libraries with large numbers of compounds can be highly useful. In particular 
embodiments, a library includes at least 50, 100, 200, 300, 400, 500, 600, 800, 1000 1400, 
or even more different compounds of ttie particular chemical structure; a library can include 
a plurality of such sets of compounds of different chemical structures selected from the 
indicated Formulas; a plurality of sets of compounds of different chemical structure can 
include any combination of the specified chemical structures, e.g.. Formulas I and H, 
Formulas I and III, Formulas II and III, Formulas I, H, and HI (including each individual 
combination of the 1 1 Formulas listed above, taken 2, 3, 4, 5,6, 7, 8, 9, 10, or 1 1 at a time); 
a plurality of sets is 2, 3, 4, 5, 6, 7, 8, 9, 10, or 1 1, or more such sets; a majority of 
compounds in a set or sets have been demonstrated to bind to one or more kinases; one or 
more kinases are selected from the kinases including PIM-1, Pyk2, c-Abl, Her2, cMet, 
VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, GskSp, or other kinase Hsted in Table 5; the 
kinase domain of the kinase has at least 30%, 35%, 40% or more sequence identity to PIM- 
1 kinase domain. 

[0013] As used herein in connection with the present compounds, the term "scaffold 
library'* refers to a defined set of compounds in a format suitable for testing as biochemical 
or biological activity modulators. For example, such compounds can be in solution or dry 
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in wells of a plate such as a microtiter plate (or a plurality of such plates). Such a library is 
distinguished from conventional compound libraries and commercially available compound 
libraries by being selected such that at least 50, 60, 70, 80, 90, 95, 98, or 100% of the 
compounds in the library are derivatives of the chemical structures that have been described 
herein as kinase binding compounds or molecular scaffolds. Such libraries can also include 
compounds that are derivatives of other kinase binding compounds. 

[0014] Because initial ligand identification can be carried out by fitting electronic 
representations of compounds in binding sites of target molecules, in another aspect the 
invention provides a system for fitting compounds in binding sites of one or more protein 
kinases. Such a system includes an electronic kinase scaffold library that includes at least 
one set of electronic representations of compounds of a chemical stmcture selected fi-om the 
group consisting of Formula 1, 11, in, IV, V, VI, and VII as described herein. Formula I, n, 
and III as described in U.S. Appl. 10/664,421 and corresponding PCT/US03/29415, and 
Formula I as described in U.S. Appl. 10/789,818 and corresponding PCT/US2004/005904, 
where tihe kinase scaffold library is embedded in a computer memory device, the electronic 
representations of the compounds can be selectively retrieved and fimctionally connected 
with computer software adapted to fit electronic representations of compounds in an 
electronic representation of a binding site of a kinase. The system can also include at least 
one electronic representation of a kinase binding site (e.g., an electronic representation of a 
crystal stracture of a kinase, kinase domain, or kinase binding site) embedded in computer 
memory such that the electronic representation of a kinase binding site can be functionally 
connected with the computer software. The system can include one or more electronic 
representations of binding sites of kinases selected from PIM-1, Pyk2, c-Abl, Her2, cMet, 
VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, GskSp, or another kinase from Table 5., e.g., 
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more such kinases. 

[0015] In a related aspect, the invention provides a method for obtaining improved ligands 
binding to a kinase (e.g., PlM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, 
Cdk2, Akt, GskSp, or other kinase from Table 5), where the method involves determining 
whether a derivative of a compound of any of Formulas I, II, HI, IV, V, VI, or VII that binds 
to the kinase, binds to the kinase with greater affinity or greater specificity or both than the 
parent binding compound. Binding with greater affinity or greater specificity or both than 
the parent compound indicates that the derivative is an improved ligand. This process can 
also be carried out in successive rounds of selection and derivatization and/or with multiple 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



7 



PCT/US2004/030360 



parent compounds to provide a compound or compoimds wilh improved ligand 
characteristics. Likewi^^^^^erivative compoxmds can be tested and selected to give high 
selectivity for the kinase, or to give cross-reactivity to a particular set of kinase targets. 

10016] In the context of the present invention, the terms "kinase" and "protein kinase" 
refers to an enzyme that phosphorylates other proteins. These enzymes are often grouped 
according to the amino acid that is phosphorylated, into tyrosine kinases, serine/threonine 
kinases, and histidine kinases. 

[0017] The term "PIM kinase" or "PIM family kinase" means a protein kinase with 
greater than 45% amino acid sequence identity to PIM-1 jfrom the same species, and 
includes PIM-1, PIM-2, and PIM-3. Unless clearly indicated to the contrary, use of the term 
"PIM kinase" constitutes a reference to any of the group of PIM kinases, specifically 
including individual reference to each of PIM-1, PIM-2, and PIM-3. 

[0018] As used herein, the terms "ligand" and "modulator" refer to a compound that 
modulates the activity of a target biomolecule, e.g:, an enzyme such as a kinase. Generally 
a ligand or modulator will be a small molecule, where "small molecule refers to a 
compound with a molecular weight of 1500 daltons or less, or preferably 1000 daltons or 
less, 800 daltons or less, or 600 daltons or less. Thus, an "improved ligand" is one that 
possesses better pharmacological and/or pharmacokinetic properties than a reference 
compound, where "better" can be defined by a person for a particular biological system or 
therapeutic use. 

[0019] In the context of binding compounds, molecular scaffolds, and ligands, the term 
"derivative" or "derivative compound" refers to a compound having a chemical structure 
that contains a common core chemical structure as a parent or reference compound, but 
differs by having at least one structural difference, e.g., by having one or more substituents 
added and/or removed and/or substituted, and/or by having one or more atoms substituted 
with different atoms. Unless clearly indicated to the contrary, the term "derivative" does 
not mean that the derivative is synthesized using the parent compound as a starting material 
or as an intermediate, although in some cases, the derivative may be synthesized from the 
parent. 
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[0020] Thus, the term "parent compound" refers to a reference compound for another 
compound, having structural features continued in the derivative compound. Often but not 
always, a parent compound has a simple chemical structure than the derivative. 

I0021J By "chemical structure" or "chemical substructure" is meant any definable atom or 
group of atoms that constitute a part of a molecule. Normally, chemical substructures of a 
scaffold or ligand can have a role in binding of the scaffold or ligand to a target molecule, or 
can influence the three-dimensional shape, electrostatic charge, and/or conformational 
properties of the scaffold or ligand. 

[0022] The temi "binds" in connection with the interaction between a target and a 
potential binding compound indicates that the potential binding compound associates with 
the target to a statistically significant degree as compared to association with proteins 
generally (z.e., non-specific binding). Thus, the term "binding compound" refers to a 
compound that has a statistically significant association with a target molecule. Preferably a 
binding compound interacts with a specified target with a dissociation constant (kd) of 1 
mM or less. A binding compound can bind with "low affinity", "very low affinity", 
"extremely low affinity", "moderate affinity", "moderately high affinity", or "high affinity" 
as described herein. 

[0023] In the context of compounds binding to a target, the term "greater affinity" 
indicates that the compound binds more tightly than a reference compound, or than the same 
compound in a reference condition, i.e., with a lower dissociation constant. In particular 
embodiments, the greater affinity is at least 2, 3, 4, 5, 8, 10, 50, 100, 200, 400, 500, 1000, or 
10,000-fold greater affinity. 

[0024] Also in the context of compounds binding to a biomolecular target, the term 
"greater specificity" indicates that a compound binds to a specified target to a greater extent 
than to another biomolecule or biomolecules that may be present imder relevant binding 
conditions, where binding to such other biomolecules produces a different biological 
activity than binding to the specified target. Typically, the specificity is with reference to a 
limited set of other biomolecules, e.g., other kinases or even other type of enzymes. In 
particular embodiments, the greater specificity is at least 2, 3, 4, 5, 8, 10, 50, 100, 200, 400, 
500, or 1000-fold greater specificity. 
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[00251 As used in connection with binding of a compound with a particular kinase, the 
term "interact" indicates that the distance from a bound compoxmd to a particular amino 
acid residue will be 5.0 angstroms or less, or 6 angstroms or less with one water molecule 
coordinated between the compound and the residue, or 9 angstroms or less with two water 
molecules coordinated between the compound and the residue, hi particular embodiments, 
the distaace from the compound to the particular amino acid residue is 4.5 angstroms or 
less, 4.0 angstroms or less, or 3.5 angstroms or less. Such distances caa be determined, for 
example, using co-crystallography, or estimated using computer fitting of a compound in 
the kinase active site. 

[0026] Reference to particular amino acid residues in PIM-1 polypeptide residue number 
is defined by the numbering provided in Meeker, T. C, Nagarajan, L., ar-Rushdi, A., 
Rovera, G., Huebner, K., Corce, C. M.; (1987) Characterization of the human PIM-1 gene: a 
putative proto-oncogene coding for a tissue specific member of the protein kinase family. 
Oncogene Res. 1 :87-101, in accordance with the sequence provided in SEQ ID NO: 1. 
PIM-2 is as described in Baytel et aL (1998) The human Pim-2 proto-oncogene and its 
testicular expression, Biochim. Biophys. Acta 1442,274-285. PIM-3 from rat is described in 
Feldman, et al. (1998) KJD-l, a protein kinase induced by depolarization in brain, J. Biol 
Chem. 273, 16535-16543; and Kinietzko et al. (1999) Pim kinase expression is induced by 
LTP stimulation and required for the consolidation of enduring LTP, EMBO J. 18, 3359- 
3369. (KID-1 is the same as PIM-3.) Human PIM-S nucleic acid and amino acid sequences 
are provided herein. 

[0027] hi a related aspect, the invention provides a mediod for developing ligands specific 
for a kinase, such as a PIM kinase, e.g., PIM-1, where the method involves determining 
whether a derivative of a compound that binds to a plurality of kinases has greater 
specificity for the particular kinase than the parent compoimd. 

[0028] As used herein in connection with binding compounds or ligands, the term 
"specific for a kinase", "specific for PIM-1" and terms of like import mean that a particular 
compound binds to the particular kinase to a statistically greater extent than to other kinases 
that may be present in a particular organism. Also, where biological activity other than 
binding is indicated, the term "specific for a kinase" indicates that a particular compound 
has greater biological activity associated with binding to the particular kinase than to other 
kinases. Preferably, the specificity is also with respect to other biomolecules (not limited to 
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kinases) that may be present from an organism. A particular compound may also be selected 
that is specific for kinase sub-group (e.g., tyrosine kinases, serine/threonine kinases, 
histidine kinases), indicating that it binds to and/or has a greater biological activity 
associated with binding to a plurality of kinases in that sub-group than to other kinases. 

[0029] In another aspect, the invention concerns a method for developing ligands binding 
to a particular kinase, e,g., PIM-1, where the method includes determining the orientation of 
at least one molecular scaffold of Formula I, II, HI, IV, V, VI, or VII in co-crystals with the 
kinase; identifying chemical structures of one or more of the molecular scaffolds, that, when 
modified, alter the binding affinity or binding specificity or both between the molecular 
scaffold and the kinase; and synthesizing a ligand in which one or more of the chemical 
structures of the molecular scaffold is modified to provide a ligand that binds to the kinase 
with altered binding affinity or binding specificity or both. Due to the significant of 
sequence identity between various kinases, e.g., PIM-1 and the other PIM kinases, PIM-1 
can also be used as a surrogate or in a homology model for orientation determination and to 
allow identification of chemical structures that can be modified to provide improved 
ligands. 

[0030] By "molecular scaffold" is meant a core molecule to which one or more additional 
chemical moieties can be covalently attached, modified, or eliminated to form a plurality of 
moleciiles with common stmctural elements. The moieties can include, but are not limited 
to, a halogen atom, a hydroxyl group, a methyl group, a nitro group, a carboxyl group, or 
any other type of molecular group including, but not limited to, those recited in this 
application. Molecular scaffolds bind to at least one target molecule, and the target 
molecule can preferably be a protein or enzyme. Preferred characteristics of a scaffold can 
include binding at a target molecule binding site such that one or more substituents on the 
scaffold are situated in binding pockets in the target molecule binding site; having 
chemically tractable stmctures that can be chemically modified, particularly by synthetic 
reactions, so that a combinatorial library can be easily constructed; having chemical 
positions where moieties can be attached that do not interfere with binding of the scaffold to 
a protein binding site, such that the scaffold or library members can be modified to achieve 
additional desirable characteristics, e.g., enabling the ligand to be actively transported into 
cells and/or to specific organs, or enabling the ligand to be attached to a chromatography 
colunm for additional analysis. 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



11 



PCT/US2004/030360 



[0031 J By "binding site" is meant an area of a target molecule to which a ligand can bind 
non-covalently. Binding sites embody particular shapes and often contain multiple binding 
pockets present within the binding site. The particular shapes are often conserved within a 
class of molecules, such as a molecular family. Binding sites within a class also can contain 
conserved stractures such as, for example, chemical moieties, the presence of a binding 
pocket, and/or an electrostatic charge at the binding site or some portion of the binding site, 
all of which can influence the shape of the binding site. 

[0032] By "binding pocket" is meant a specific volume within a binding site. A binding 
pocket can often be a particular shape, indentation, or cavity in the binding site. Binding 
pockets can contain particular chemical groups or structures that are important in the non- 
covalent binding of another molecule such as, for example, groups that contribute to ionic, 
hydrogen bonding, or van der Waals interactions between the molecules. 

[0033] By "orientation", in reference to a binding compound bound to a target molecule is 
meant the spatial relationship of the binding compound and at least some of its constituent 
atoms to the binding pocket and/or atomg ( f the target molecule at least parti ally defining 
the binding pocket. 

[0034] By "co-crystals" is meant a complex of il^e compound, molecular scaffold, or 
ligand bound non-covalently to the target molecwb m\d present in a crystal form appropriate 
for analysis by X-ray or protein crystallography, hi j veferred embodiments the target 
molecule-ligand complex can be a protein-ligand complex. 

[0035] The phrase "alter the binding affinity or binding specificity" refers to changing the 
binding constant of a first compound for another, or changing the level of binding of a first 
compound for a second compound as compared to the level of binding of the first 
compound for third compoimds, respectively. For example, the binding specificity of a 
compound for a particular protein is increased if the relative level of binding to that 
particular protein is increased as compared to binding of the compound to unrelated 
proteins. 

[0036] As used herein in connection with test compounds, binding compounds, and 
modulators (ligands), the term "synthesizing" and like terms means chemical synthesis from 
one or more precursor materials. 
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[0037] The phrase "chemical structure of the molecular scaffold is modified" means that a 
derivative molecule has a chemical structure that differs from that of the molecular scaffold 
but still contains conunon core chemical structural features. The phrase does not necessarily 
mean that the molecular scaffold is used as a precursor in the synthesis of the derivative. 

[0038] By "assaying" is meant the creation of experimental conditions and the gathering 
of data regarding a particular result of the experimental conditions. For example, enzymes 
can be assayed based on their ability to act upon a detectable substrate. A compound or 
Ugand can be assayed based on its ability to bind to a particular target molecule or 
molecules. 

[0039] Certain compounds have been identified as molecular scaffolds and binding 
compounds for protein kinases, as exemplified by PIM-1. Thus, in another aspect, the 
invention provides a method for identifying a ligand binding to specific kinase, that includes 
determining whether a derivative compoxmd that includes a core structure selected firom the 
group consisting of the core stmctures of Formula I, Formula II, Formula III, Formula IV, 
Formula V, Formula VI, and Formula VII, as described herein binds to the kinase with 
altered binding affinity or specificity or both as compared to a parent compound. 

[0040J In reference to compounds of any of Formula I, II, III, IV, V, VI, and VII, the term 
"core structure" refers to the structures shown diagramatically as part of the description of 
compounds of each of Formulas I- VII, but excluding non-ring variable substituents. More 
generally, the term "core stmcture" refers to a characteristic chemical structure common to 
a set of compounds, especially a chemical stracture that carries variable substituents in the 
compoimd set. 

[0041] By a "set" of compounds is meant a collection of compounds. The compounds 
may or may not be structurally related. 

[0042] The invention further concerns co-crystals of a particular kinase and a kinase 
binding compound of Formula I, II, III, IV, V, VI, or VII. Advantageously, such co-crystals 
are of sufficient size and quality to allow structural determination to at least 3 Angstroms, 
2.5 Angstroms, or 2.0 Angstroms. The co-crystals can, for example, be in a crystallography 
plate, be mounted for X-ray crystallography and/or in au X-ray beam. Such co-crystals are 
beneficial, for example, for obtaining stmctural information concerning interaction between 
the kinase and kinase binding compounds. 
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[0043] Kinase binding compounds can include compounds that interact with at least one 
of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186, or any 2, 3, 4, 5, 6, or 7 of those 
residues. Exemplary compounds that bind to PIM-1 include compounds of any of Formulas 

i-vn. 

[0044] Likewise, in additional aspects, methods for obtaining PIM-1 co-crystals with 
compounds of Formula I, II, III, IV, V, VI, and VII are provided. The method involves 
subjecting PIM-1 protein at 5-20 mg/ml to crystallization conditions substantially 
equivalent to Hampton Screen 1 conditions 2, 7, 14, 17, 23, 25, 29, 36, 44, or 49, in the 
presence of binding compoimd for a time sufficient for crystal development. The binding 
compound may be added at various concentrations depending on the nature of the 
compound, e.g., final concentration of 0.5 to 1.0 mM. In many cases, the binding 
compoimd will be in an organic solvent such as dimethyl sulfoxide solution. Exemplary co- 
crystallization conditions include 0.4-0.9 M sodium acetate trihydrate pH 6.5, 0.1 M 
imidazole; or 0.2-0.7 M. sodium potassium tartrate, 00.1 M MES buffer pH 6.5. 

[0045] In another aspect, provision of compoxmds active on a variety of different kinases 
(e.g., PIM-1) also provides a method for modulating kinase activity by contacting the kinase 
with a compound of any of Formulas I, II, HI, IV, V, VI, and Vn that binds to the kinase. 
The compoimd is preferably provided at a level sufficient to modulate the activity of PIM-1 
by at least 10%, more preferably at least 20%, 30%, 40%, or 50%. In many embodiments, 
the compoimd will be at a concentration of about 1 pM, 100 jaM, or 1 mM, or in a range of 
1-100 nM, 100-500 nM, 500-1000 nM, 1-100 joM, 100-500 joM, or 500-1000 ^M. The 
compound can be one that interacts with one more of PIM-1 residues 49, 52, 65, 67, 121, 
128, and 186. 

[0046] As used herein, the term "modulating" or "modulate" refers to an effect of altering 
a biological activity, especially a biological activity associated with a particular biomolecule 
such as PIM-1 . For example, an agonist or antagonist of a particular biomolecule modulates 
the activity of that biomolecule, e.g., an enzyme. 

[0047] The term "PIM-1 activity" refers to a biochemical activity of PIM-1, particularly 
including kinase activity. 

[0048] In the context of the use, testing, or screening of compounds that are or may be 
modulators, the term "contacting" means that the compound(s) are caused to be in sufficient 
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proximity to a particular molecule, complex, cell, tissue, organism, or other specified 
material that potential l^^^^^nteractions and/or chemical reaction (e.g., modulating 
enzymatic action) betweeri thi compotmd and other specified material can occur. 

[0049] In a related aspect, the invention provides a method for treating a patient suffering 
firom or at risk of a kinase-mediated disease or condition or a disease or condition in which 
kinase modulation provides a therapeutic benefit, such as a disease or condition 
characterized by abnormal kinase activity, e.g., PIM-1 activity, where the method involves 
administering to the patient a compound of Formula I, n. III, IV, V, VI, or VII. The 
compound can, for example, be one that interacts with one or more of PIM-1 residues 49, 
52, 65, 67, 121, 128, and 186. 

[0050} In certain embodiments, the disease or condition is a proliferative disease or 
neoplasia, such as benign or malignant tumors, psoriasis, leukemias (such as myeloblastic 
leukemia), lymphoma, prostate cancer, liver cancer, breast cancer, sarcoma, neuroblastoma, 
Wilm's tumor, bladder cancer, thyroid cancer, neoplasias of the epithelial origin such as 
mammacarcinoma, or a chronic inflammatory dis ^ase or condition, resulting, for example, 
fi"om a persistent infection (e.g:, tuberculosis, sypl iliS;, fungal infection), from prolonged 
exposure to endogenous (e,g., elevated plasma lipid 5) :r exogenous (eg;, silica, asbestos, 
cigarette tar, surgical sutures) toxins, and fi^om autoimoAOi e -©actions (e.g;, rheumatoid 
arthritis, systemic lupus erythrymatosis, multiple sclerosis ^isoriasis). Thus, chronic 
inflammatory diseases include many common medical condtions, such as rheiunatoid 
arthritis, restenosis, psoriasis, multiple sclerosis, surgical adhesions, tuberculosis, and 
chronic inflammatory lung and airway diseases, such as asthma pheumoconiosis, chronic 
obstmctive pulmonary disease, nasal polyps, and pulmonary fibrosis. Kinase modulators 
may also be useful in inhibiting development of hematomous plaque and restinosis, in 
controlling restinosis, as anti-metastatic agents, in treating diabetic complications, as 
immunosuppressants, and in control of angiogenesis to the extent the kinase is involved in a 
particular disease or condition. 

[0051] In certain embodiments, the disease or condition is one listed in Table 5; the 
disease or condition is one listed in Table 5. In certain embodiments, the therapeutic or 
prophylactic effect of the compoimd is due to modulation of a kinase from Table 5; the 
therapeutic or prophylactic effect of the compound is due to modulation of a kinase from 
Table 5 and the disease or condition is one that corresponds thereto in Table 5. 
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[0052] As used herein, the term "kinase-mediated" disease or condition and like terms 
refer to a disease or condition in which the biological function of a kinase affects the 
development and/or course of the disease or condition, and/or in which modulation of a 
kinase alters the development, course, and/or symptoms of the disease or condition. 
Similarly, the phrase "kinase modulation provides a therapeutic benefit" indicates that 
modulation of the level of activity of a kinase in a subject indicates that such modulation 
reduces the severity and/or duration of the disease, reduces the likelihood or delays the 
onset of the disease or condition, and/or causes an improvement in one or more symptoms 
of the disease or condtion. Parallel terms apply to each of the kinases indicated herein. 

[0053] Because molecular scaffolds are described, a large number of different kinases can 
be used in connection with the described scaffolds and compounds. A Ust of such kinases 
(not intended to be comprehensive) is provided in Table 6. Exemplary kinases and a major 
indication for which modulation of the kinase is useful include the following: 



Table 6 




Tvrosine Kinases 


Indication 


c-abl 


CML (chronic myeloid leukemia) 


her2 


breast cancer 


c-met 


cancer 


VEGFR 


angiogenesis 


c-kit 


cancer 


Serine Kinases 




pkcp 


retinopathy 


p38 


inflammation/RA 


cdk2 


cancer 


akt 


cancer/ apoptosis 


gskSp 


diabetes 



[0054] As crystals of PlM-1 and other kinases have been developed and analyzed, another 
aspect concerns an electronic representation of the kinase with an electronic representation 
of a kinase binding compound or a test compound in the binding site, where the compound 
has a chemical stracture of Formula I, Formula H, Formula EI, Formula IV, Formula V, 
Formula VI, or Formula VII. 

[0055] Likewise, in a related aspect, the invention concerns an electronic representation of 
a portion of a kinase binding site, e.g., PIM-l, (which can be an active site), which includes 
a representation of Formula I, II, III, IV, V, VI, or VH. A binding site can be represented in 
various ways, e.g., as representations of atomic coordinates of residues around the binding 
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site and/or as a binding site surface contour, and can include representations of the binding 
character of particular residues at the binding site, e.g., conserved residues. 

[0056] In another aspect, the invention provides a method for identifying potential kinase, 
e.g.^ PIM-1 or other kinase listed herein, binding compounds by fitting at least one 
electronic representation of a compound of Formula 1, 11, III, IV, V, VI, or VII in an 
electronic representation of a kinase, e.g., PIM-1, binding site. The representation of the 
binding site may be part of an electronic representation of a larger portion(s) (e.g., kinase 
domain) or all of a PIM molecule or may be a representation of only the binding site. The 
electronic representation may be as described above or otherwise described herein. 

[0057] In particular embodiments, the method involves fitting a computer representation 
of a compound firom a computer database with a computer representation of the active site 
of a kinase, e.g,, PIM-1; and involves removing a computer representation of a compound 
complexed with the kinase molecule and identifying compounds that best fit the active site 
based on favorable geometric fit and energetically favorable complementary interactions as 
potential binding compounds. 

[0058] In other embodiments, the method involves modifying a computer representation 
of a compound complexed with a kinase molecule, e,g, PIM-1, by the deletion or addition 
or both of one or more chemical groups; fitting a computer representation of a compound 
fi"om a computer database with a computer representation of the active site of the kinase 
molecule; and identifying compounds that best fit the active site based on favorable 
geometric fit and energetically favorable complementary interactions as potential binding 
compounds. 

[0059] In still other embodiments, the method involves removing a computer 
representation of a compound complexed with a kinase, such as PIM-1, and searching a 
database for compounds having stractural similarity to the complexed compound using a 
compound searching computer program or replacing portions of the complexed compound 
with similar chemical stmctures using a compoimd constraction computer program. 

[0060] Fitting a compound can include determining whether a compound will interact 
with one or more of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186. 

[0061 ) In another aspect, the invention concerns a method for attaching a kinase binding 
compound of Formula I, II, HI, IV, V, VI, or VII (e.g., a PIM-1 binding compound) to an 
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attachment component, as well as a method for identifying attachment sites on such kinase 
binding compound. Tt^metiiod involves identifying energetically allowed sites for 
attachment of an attachment component; and attaching the compound or a derivative thereof 
to the attachment component at the energetically allowed site. In certain embodiments, the 
kinase is a kinase listed herein; the kinase has at least 25% amino acid sequence identity or 
30% sequence similarity to wild type PIM-1, and/or includes conserved residues matching 
at least one of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186 (i.e^, matching any one, any 
2, 3, 4, 5, 6, or 7 of those residues). 

[00621 Attachment components can include, for example, linkers (including traceless 
linkers) for attachment to a solid phase or to another molecule or other moiety. Such 
attachment can be formed by synthesizing the compound or derivative on the linker attached 
to a solid phase medium e.g., in a combinatorial synthesis in a plurality of compound. 
Likewise, the attachment to a solid phase medium can provide an affinity medium (e.g., for 
affinity chromatography). 

[0063] The attachment component can also include a label, which can be a directly 
detectable label such as a fluorophore, or an indirectly detectable such as a member of a 
specific binding pair, e.g., biotin. 

[0064] The ability to identify energetically allowed sites on a kinase binding compound of 
Formula I, II, III, IV, V, VI, and VII, e.g., a PIM-1 binding compound also, in a related 
aspect, provides modified binding compounds that have linkers attached, for example, 
compounds of Fomiula I- VII, preferably at an energetically allowed site for binding of the 
modified compound to a kinase. The linker can be attached to an attachment component as 
described above. 

[0065] Still another aspect of the invention concerns a method for developing a ligand for 
a kinase that includes conserved residues matching any one, 2, 3, 4, 5, 6, or 7 of PIM-l 
residues 49, 52, 65, 67, 121, 128, and 186, by determining whether a compound of Formula 
I, U, ni, rV, V, VI, or Vn binds to the kinase. The method can also include detennimng 
whether the compound modulates the activity of the kinase. In certain embodiments, the 
kinase has at least 25% sequence identity or at least 30% sequence similarity to PIM-1, or 
PIM-1 kinase domain. 
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[0066] In particular embodiments, the determining includes computer fitting the 
compound in a binding site of the kinase and/or the method includes forming a co-crystal of 
the kinase and the compound. Such co-crystals can be used for determining the binding 
orientation of the compound with the kinase and/or provide structural information on the 
kinase, e.g., on the binding site and interacting amino acid residues. Such binding 
orientation and/or other structural information can be accomplished using X-ray 
crystallography. 

[0067] The invention also provides compounds of Formula 1, 11, EI, IV, V, VI, and Vn 
that bind to and/or modulate {e.g.^ inhibit) kinase activity for a particular kinase, e.g., PIM- 
1 . Accordingly, in aspects and embodiments involving kinase binding compounds, 
molecular scaffolds, and ligands or modulators, the compound is a weak binding compoimd; 
a moderate binding compound; a strong binding compound; a compound that binds at a 
level identified herein; the compound interacts with one or more of PIM-1 residues 49, 52, 
65, 67, 121, 128, and 186; the compoimd is a small molecule; the compound binds to a 
plurality of different kinases (e.g.y at least 5, 10, 15, 20 different kinases). In particular 
embodiments, the invention concerns compounds of Formula I, Formula II, Formula HI, 
Formula IV, Formula V, Formula VI, and Formula VH as described below. 

[0068] In certain embodiments, the invention concerns compounds of Formula I: 



where, with reference to Formula I: 

[0069] is hydrogen, optionally substituted lower alkyl, optionally substituted lower 
alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocyclyoalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, optionally substituted heteroaralkyl, -C(X)R^^, - 




Formula I 



C(X)NR 



-S(0)2R^', or-S(0)2NR'^R". 
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[0070] is hydrogen, halo, optionally substituted lower alkyl (e.g., trifluoromethyl), 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, --C(X)NR^^R^^, -NR^V^ , or -S(0)2R^^ or -S(0)2NR^^^^ with the proviso 
that if is attached to nitrogen, it is not -NR^^R^^ 

[0071] R^, R\ R^ and R^ are independently hydrogen, halo, hydroxy, optionally 
substituted alkoxyl, optionally substituted thioalkoxy, optionally substituted amine, 
optionally substituted lower alkyl (e.g., trifluoromethyl), optionally substituted lower 
alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, or optionally substituted heteroaralkyl, -C(X)R^^, 
C(X)NR^^^^, S(0)2NR^^R^^, -NR^^R^^ , or -S(0)2R^^ 

[0072] R^^ and R^^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, or R^^ and R^^ together form a 5-7 membered carbocyclic or heterocyclic 
ring; 

[0073] R^^ is hydroxy!, optionally substituted lower alkoxy, optionally substituted amine, 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl, optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; 

[0074] R^^ is optionally substituted lower alkoxy, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, or optionally substituted 
heteroaralkyl; 
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[0075] R^^ and R^^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted araUcyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, -C(X)R^", C(X)NR'^", or-S(0)2R^'; 

[0076] w, y, and z are independently O, S, N, or CR ; 

[00771 qisNorC; 

[0078] X == O or S; and 

(00791 n = 1 or 2. 

[0080] In particular embodiments of compounds of Formula I, with reference to Formula 
I, one or more of R\ R^ R^ R^ R\ R^ is H; any two of R^ R^ R^ R^ R^ R^ are H; any 3 
of R^ R^ R', R\ R^ R' are H; any 4 of R\ R', R', R', R', are H; any 5 of R^ R^ R^ R*, 
R^ R^ are H. Specification of the preceding subgroups is intended to expressly include each 
possible combination of the specified substituent groups. For example, in particular 
embodiments, R^ R^ R^ and R^ are H. 

[0081] Likewise, in certain embodiments the invaation concerns compounds of Formula 

n. 



Where, with reference to Formula 11: 

[0082J R' is hydrogen, halo, hydroxy, optionally substituted alkoxyl» optionally 
substituted thioaUcoxy, optionally substituted amine, optionally substituted lower alkyl (e.g., 
trifluoromethyl), optionally substituted lower alkenyl, optionally substituted lower alkynyl, 
optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally 
substituted aryl, optionally substituted aralkyl, optionally substituted heteroaryl, or 



a— b 




Formula II 
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optionally substituted heteroaralkyl, -C(X)R^ C(X)NR^R^ S(0)2NR V, -NR^R'^ , or - 
S(0)2R'; 

[0083] a, b, c, and d are independently O, S, NR^, or CR^* with the proviso that two of 
them are N (and not more than 2) and one (and not more than one) of them is either O or S, 
and the remaining one is CR^ ^; 

[0084] R^ is hydroxyl, optionally substituted lower alkoxy, optionally substituted amine, 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl, optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; 

[0085] R^ and R"* are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, or optionally substituted 
heteroarallcyl; 

[0086] R^ is optionally substituted lower alkoxy, optionally substituted amine, optionally 
substituted lower-alkyl, optionally substituted lower alkenyl, optionally substituted lower 
alkynyl, optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, 
optionally substituted aryl, optionally substituted aralkyl, optionally substituted heteroaryl, 
or optionally substituted heteroaralkyl; 

[0087] R" is hydroxy, optionally substituted alkoxyl, optionally substituted thioalkoxy, 
optionally substituted amine, optionally substituted lower alkyl (e.g., trifluoromethyl), 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, or optionally substituted 
heteroaralkyl, -C(X)R^ C(X)NR^R^ S(0)2NRV, -NR^R^ , or -S(0)2R^ 

[0088] hi particular embodiments of compounds of Formula H, with reference to Formula 
n, a & b, a & c, a& d, b & c, b & d, or c & d are N. In embodiments where a & b are N, c is 
S or O, or d is S or O; where a & c are N, b is S or O, or d is S or O; where a & d are N, b is 
S or O, or c is S or O; where b & c are N, a is S or O, or d is S or O; where b & d are N, a is 
S or O, or c is S or O, where c & d are N, a is S or O, or b is S or O. In particular 
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embodiments of each of the preceding designations of a, b, c, and d, ^ is optionally 
substituted hydrogen or halo; R*^ is trifluoromethyl, hydroxy, optionally substituted 
alkoxyl, optionally substituted thioalkoxy; R^^ is optionally substituted amine; R^^ is 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl; R^' is optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; R'^ is -C(X)R^, QX^NR^R"^, 
S(0)2NR^R^ -NR^R^ , or -S(0)2R^ 

[0089] In other embodiments, the invention concerns compounds of Formula HI. 



Formula III 

where, with reference to Formula HI: 

[0090] R\ R^, and R^ are independently hydrogen, halo, hydroxy, optioi>o/y substituted 
alkoxyl, optionally substituted thioalkoxy, optionally substituted amine, option illy 
substituted lower alkyl (e.g., trifluoromethyl), optionally substituted lower ;aE<t tiyl, 
optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, or optionally substituted heteroaralkyl, -C(X)R'*, - 
C(X)NR^R^ ~S(0)2NR^R^ -NR^R^ , or -S(0)2R^; 

[0091] R"^ is hydroxyl, optionally substituted lower alkoxy, optionally substituted amine, 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl, optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; 

[0092] R^ and R^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl. 
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Optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, or and together for a 5-7 membered carbocyclic or heterocyclic ring; 

[0093] R^ is optionally substituted lower alkoxy, optionally substituted amine, optionally 
substituted lower alkyl, optionally substituted lower alkenyl, optionally substituted lower 
alkynyl, optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, 
optionally substituted aryl, optionally substituted aralkyl, optionally substituted heteroaryl, 
or optionally substituted heteroaralkyl; 

[0094] X = O or S ; and n = 0 or 1 . 

[0095] In particular embodiments of compounds of Formula HI, with reference to 
Formula HI, X is O; X is S. In particular embodiments for each selection of X, R^ is H; R^ 
is H; R^ is H; R^ and R^ are H; R^ and R^ are H. In particular embodiments for each 
selection of X with each selection where one of R^ R^, and R^ is H, the other two of R^ R^ 
and R^ are independently halo, trifluoromethyl; the other two of R\ R^ and R"' are 
independently hydroxy, optionally substituted alkoxyl, optionally substituted thioalkoxy; 
the other two of R*, R^ and R^ are independently optionally substituted amine; the other two 
of R^, and R^ are independently optionally substituted lower alkyl, optionally substituted 
lower alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, 
optionally substituted heterocycloalkyl, optionally substituted aryl, optionally substituted 
aralkyl, optionally substituted heteroaryl, or optionally substituted heteroaralkyl; the other 
two of R^ and R^ are independently -C(X)R\ -C(X)NR^R^ -S(0)2NR^R^, -NR^R^ , or 
— S(0)2R^. In particular embodiments, one of R^, R*^, and R^ is halo, trifluoromethyl; one of 
R^ R*^, and R'' is hydroxy, optionally substituted alkoxyl, optionally substituted thioalkoxy; 
one of R^ R^, and R^ is optionally substituted amine; one of R\ R^, and R"^ is optionally 
substituted lower alkyl, optionally substituted lower alkenyl, optionally substituted lower 
alkynyl, optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, 
optionally substituted aryl, optionally substituted aralkyl, optionally substituted heteroaryl, 
or optionally substituted heteroaralkyl one of R^ R^, and R^ is -C(X)R^, -C(X)NR^R^ - 
S(0)2NR^R^ -NR^R^ , or ~S(0)2R^. 
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[0096] In other embodiments, the invention concerns compounds of Formula IV. 

^^^f^^ OH 



Formula IV 

where, with reference to Formula IV: 

[0097] is optionally substituted lower alkyl, optionally substituted lower alkenyl, 
optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, optionally substituted heteroaralkyl, ~NR^^^^, -OR^^ or 
-SR^^ 

[0098] R^ and R^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, optionally substituted heteroaralkyl, ^(X)R^^ ^(X)NR^^^^, -<:(X)R^^, or 
"C(X)NR^^^^; 

[0099] R^^ and R^^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, or R^^and R^^ together form a carbocyclic or heterocyclic ring; 

[0100] R^*^ is hydroxyl, optionally substituted lower alkoxy, optionally substituted amine, 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl, optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; and 

[0101] R^^ is optionally substituted lower alkyl, optionally substituted lower alkenyl, 
optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
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substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, or optionally substituted heteroaralkYl. 

[01021 In particular embodiments of compounds of Formula IV, with reference to 
Formula IV, is optionally substituted lower alkyl; is optionally substituted lower 
aJkenyl; R^ is optionally substituted lower alkynyl, optionally substituted cycloalkyi; R^ is 
optionally substituted heterocycloalkyl; R^ is optionally substituted aryl, optionally 
substituted aralkyl; R^ is optionally substituted heteroaryl, optionally substituted 
heteroaralkyl; R^ is -NR^ -OR^\ -SR^\ -C(X)R^^ or -C(X)NR^^R^^. In particular 
embodiments, R^ or R"^ (but not both) is hydrogen; R^ or R^ (but not the other) is optionally 
substituted lower alkyl, optionally substituted lower alkenyl, optionally substituted lower 
alkynyl, optionally substituted cycloalkyi; R^ or R^ (but not the other) is optionally 
substituted heterocycloalkyl; R^ or R^ (but not the other) is optionally substituted aryl, 
optionally substituted aralkyl; R^ or R*^ (but not the other) is optionally substituted 
heteroaryl, optionally substituted heteroaralkyl, or optionally substituted heteroaralkyl; R^ 
or R^ (but not both) is -C(X)R^^ or -C(X)NR^^'^. In particular embodiiTients, R^ is H; R^ 
isH;R^andR^ areH. 

[01031 The invention also concerns compounds of Formula V- 



Where, with reference to Formula V: 

[01041 R^ and R^ are independently hydrogen, hydroxyl, optionally substituted alkoxyl, 
optionally substituted thioalkoxy, optionally substituted lower alkyl, optionally substituted 
lower alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyi, 
optionally substituted heterocycloalkyl, optionally substituted aryl, optionally substituted 
aralkyl, optionally substituted heteroaryl, optionally substituted heteroaralkyl, -NR^^^^ , - 
C(X)R^^ C(X)NR'^R'^, -S(0)2NR^ -S(0)2R^', or R^ and R^ when one of them is - 
NR^^^^, hydroxyl, alkoxyl, thioalkoxyl, aralkyl or heteroaralkyl and the other one is 




Formula V 
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hydrogen can combine to foim =NR'^, =0, =S, or =Caryl/heterDaryl, with the proviso that 
R' and both ctmnot be hydroxy!, alkoxyl, thioalkoxyl or -NR'^R" at the same time; 

10105] R^ is hydrogen, halo, optionally substituted lower alkyl, optionally substituted 
lower alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, 
optionally substituted heterocycloalkyl, optionally substituted aiyl, optionally substituted 
aralkyl, optionally substituted heteroaryl, optionally substituted heteroaralkyl, optionally 
substituted heteroaralkyl, -C(X)R^^ or C(X)NR'^''; 

[0106] R^, R*, R^, and R^ are independently hydrogen, halo, hydroxyl, optionally 
substituted alkoxyl, optionally substituted thioalkoxy, optionally substituted amine, 
optionally substituted lower alkyl (e.g., trifluoromethyl), optionally substituted lower 
alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 

20 

optionally substituted heteroaryl, optionally substituted heteroaralkyl, -C(X)R , - 
C(X)m*^R", -S(0)2NR'^'^or -S(0)2R^^; 

[0107] R'^ and R" are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, or R'^ and R*' together form a carbocyclic or heterocyclic ring; 

[0108] R^° is hydroxyl, optionally substituted lower alkoxy, optionally substituted amine, 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl, optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; and 

[01 09] R^^ is optionally substituted amine, optionally substituted lower alkyl, optionally 
substituted lower alkenyl, optionally substituted lower alkynyl, optionally substituted 
cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, optionally 
substituted aralkyl, optionally substituted heteroaryl, or optionally substituted heteroaralkyl. 

[01 10] In particular embodiments of compounds of Formula V, with reference to Formula 
V, R' is H; R^ is H; R^ is H; R'* is H; R^ is H; R^ is H; each combination of two of R', R^ 
R^, R*, R^, and R^ are H and the others; each combination of three of R', R^, R^, R**, R^, and 
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R*^ are H; each combination of 4 of R', R^, R^, R"*, R^, and R^ are H; each combination of 5 
of R', r\ r\ R^ r\ ar^S^. ^: 

\ 

[01 1 1 ] Likewise, the invention concerns compounds of Fonnula VI. 



Formula VI 

where, with reference to Fonnula VI: 

[0112] is hydrogen, optionally substituted lower alkyl, optionally substituted lower 
alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, optionally substituted heteroaralkyl, -C(X)R^^, - 
C(X)NR^^R'^ -S(0)2NR''ll^^, or -S(0)2R^^; 

[0113] R^ is hydrogen, halo, optionally substituted lower alkyl, optionally substituted 
lower alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, 
optionally substituted heterocycloalkyl, optionally substituted aryl, optionally substituted 
aralkyl, optionally substituted heteroaryl, optionally substituted heteroaralkyl, optionally 
substituted heteroaralkyl, -C(X)R^^, ^C(X)NR^^'^ -S(0)2NR^^'^ or -S(0)2R^'; 

[01 14] R^ and R'* are independently hydrogen, halo, hydroxyl, optionally substituted 
alkoxyl, optionally substituted thioalkoxy, optionally substituted amine, optionally 
substituted lower alkyl (e.g., trifluoromethyl), optionally substituted lower alkenyl, 
optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, 
optionally substituted heteroaryl, optionally substituted heteroaralkyl, -NR^^R^^ * -C(X)R^^, - 
C(X)NR^^R'^ -S(0)2NR^^R^^or -S(0)2R^^ or R^ and R^ when one of them is -«NR^^R^^ 
hydroxyl, alkoxyl, thioalkoxyl, aralkyl or heteroaralkyl and the other one is hydrogen can 
combine to form =NR^^, =0, =S, or =Caryl/heteroaryl, with the proviso that R^ and R^ both 
cannot be hydroxyl, alkoxyl, thioalkoxyl or -NR^^R^^ at the same time; 

[0115] R^^ and R^^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenvL ootionallv substituted lower alkvnvL optionally 
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substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyU or R^*^ and R^^ together form a carbocyclic or heterocyclic ring; 

[0116] R^^ is hydroxyU optionally substituted lower alkoxy, optionally substituted amine, 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl, optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; and 

[0117] R^^ is optionally substituted lower alkoxy, optionally substituted amine, optionally 
substituted lower alkyl, optionally substituted lower alkenyl, optionally substituted lower 
alkynyl, optionally substituted cycloalkyl, optionally substituted heterocycloalkyl, 
optionally substituted aryl, optionally substituted aralkyl, optionally substituted heteroaryl, 
or optionally substituted heteroaralkyL 

[0118] In particular embodiments of compounds of Formula VI, with reference to 
Formula VI, R* is H; R^ is H; R^ is H; R^ and R^ are H; R^ and R^ are H; R' is optionally 
substituted lower alkyl, optionally substituted lower alkenyl, optionally substituted lower 
alkynyl; R* is optionally substituted cycloalkyl, optionally substituted heterocycloalkyl; R^ 
is optionally substituted aryl, optionally substituted aralkyl, optionally substituted 
heteroaryl, optionally substituted heteroaralkyl; R^ is -C(X)R^°, -C(X)NR^^^'^, - 
S(0)2NR^ or -S(0)2R^^; R^ is halo; R^ is optionally substituted lower alkyl, optionally 
substituted lower alkenyl, optionally substituted lower alkynyl; R^ is optionally substituted 
cycloalkyl, optionally substituted heterocycloalkyl; R^ is optionally substituted aryl, 
optionally substituted aralkyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, or optionally substituted heteroaralkyl; R^ is •-C(X)R^°, -C(X)NR^^''^, - 
S(0)2NR^^^^ or -S(0)2R^^ R^ is halo; R^ is trifluoromethyl; R^ is optionally substituted 
alkoxyl, optionally substituted thioalkoxy; R^ is optionally substituted amine; R^ is 
optionally substituted lower alkyl, optionally substituted lower alkenyl, optionally 
substituted lower alkynyl; R^ is optionally substituted cycloalkyl, optionally substituted 
heterocycloalkyl; R^ is optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; R^ is -C(X)R^^, - 
C(X)NR^^^^, -S(0)2NR^^R' V-S(0)2R^^ 
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[01191 In yet other embodiments, the invention concerns compoimds of Formula VII. 



Formula VII 

where, with reference to Formula VII: 

[0120] is hydrogen, optionally substituted lower alkyl, optionally substituted lower 
alkenyl, optionally substituted lower alkynyl, optionally substituted cycloalkyl, optionally 
substituted heteralkyl ^optionally substituted heterocycloalkyl, optionally substituted aryl, 
optionally substituted araBcyl, optionally substituted heteroaryl, optionally substituted 
heteroaralkyl, -C(X)R^^, -CCX:)^^^R^ V^, -S(0)2NR^^'^or-S(0)2R^^; 

[0121] R^, R^, R\ R^, R^> atid R^ are independently hydrogen, halo, optionally substituted 
arvuntJ, optionally substituted alkoxy, optionally substituted thioether, optionally substituted 
Jo wex aJkyl, optionally substituted lower alkenyl, optionally substituted lower alkynyl, 
optiona .iy substituted cycloalkyl, optionally substituted heterocycloalkyl, optionally 
substi tited aryl, optionally substituted aralkyl, optionally substituted heteroaryl, optionally 
substituted heteroaralkyl, optionally substituted heteroaralkyl, -C(X)R^°, -C(X)NR^^R^^, - 
S(0)2NR'^R^^ or -S(0)2R^^ 

[0122] R^^ and R^^ are independently hydrogen, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heteroalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, optionally substituted heteroaralkyl, or R^^ and R^^ together form a 
carbocyclic or heterocyclic ring; 

[0123] R^^ is hydroxy}, optionally substituted lower alkoxy, optionally substituted lower 
alkyl, optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heteroalkyl, optionally substituted 
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heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl; and 

[0124] R^^ is optionally substituted lower alkoxy, optionally substituted lower alkyl, 
optionally substituted lower alkenyl, optionally substituted lower alkynyl, optionally 
substituted cycloalkyl, optionally substituted heteroalkyl, optionally substituted 
heterocycloalkyl, optionally substituted aryl, optionally substituted aralkyl, optionally 
substituted heteroaryl, or optionally substituted heteroaralkyl. 

[01251 In particular embodiments of compoimds of Formula VII, with reference to 
Fonnula VH, is H; R^ is H; R^ is H; R^ is H; R^ is H; R^ is H; R^ is H; each combination 
of two of R^ R^, R^ R^ R^ R^ and R*^ are H; each combination of three of R\ R^ R^, R^, 
R^ R^, and R^ are H; each combination of 4 of R^ R^ R^ R^, R^ R^, and R^ are H; each 
combination of 5 of R^ R^, R^ R"^, R^, R^, and R^ are H; each combination of six of R^ R^, 
R^ R^ R^ R^, and are H (e.g., all except for R^ and R^). 

[0126] As used in connection with the present invention, unless indicated clearly to the 
contrary, the description of compounds and the use of such compounds, e.g., compounds of 
any of Formula I-Vn, includes pharmaceutically acceptable salts and solvates of such 
compounds. 

[0127] In connection with the compounds of Formulas I- VII, the following definitions 
apply. 

[0128] "Halo" or "Halogen" - alone or in combination means all halogens, that is, chloro 
(CI), fluoro (F), bromo (Br), iodo (I). 

[0129] "Hydroxyl" refers to the group -OH. 

[0130] "Thiol" or "mercapto" refers to the group -SH. 

[0131] "Alkyl" - alone or in combination means an alkane-derived radical containing 
firom 1 to 20, preferably 1 to 15, carbon atoms (unless specifically defined). It is a straight 
chain alkyl, branched alkyl or cycloalkyL Preferably, straight or branched alkyl groups 
containing firom 1-15, more preferably 1 to 8, even more preferably 1-6, yet more preferably 
1-4 and most preferably 1-2, carbon atoms, such as methyl, ethyl, propyl, isopropyl, butyl, t- 
butyl and the hke. The term "lower alkyl" is used herein to describe the straight chain alkyl 
groups of 1-6, 1-4, or 1-2 carbon atoms. Preferably, cycloalkvl erouns are monocvcHc. 
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bicyclic or tricyclic ring systems of 3-8, more preferably 3-6, ring members per ring, such 
as cyclopropyl, cyclopentyl, cyclohexyl, and the like, but can also include larger ring 
structures such as adamantyl. Alkyl also includes a straight chain or branched alkyl group 
that contains or is interrupted by a cycloalkyl portion. The straight chain or branched alkyl 
group is attached at any available point to produce a stable compound. Examples of this 
include, but are not limited to, 4-(isopropyl)-cyclohexylethyl or 2-methyl- 
cyclopropylpentyl. A substituted alkyl is a straight chain alkyl, branched alkyl, or 
cycloalkyl group defined previously, independently substituted with 1 to 3 groups or 
substituents of halo, hydroxy, alkoxy, alkylthio, alkylsulfinyl, alkylsulfonyl, acyloxy, 
aryloxy, heteroaryloxy, amino optionally mono- or di-substituted with alkyl, aryl or 
heteroaryl groups, amidino, urea optionally substituted with alkyl, aryl, heteroaryl or 
heterocyclyl groups, aminosulfonyl optionally N-mono- or N,N-di-substituted with alkyl, 
aryl or heteroaryl groups, alkylsulfonylamino, arylsulfonylamino, heteroarylsulfonylamino, 
alkylcarbonylamino, arylcarbonylamino, heteroarylcarbonylamino, or the like. 

[0132] "Alkenyl'* - alone or in combination means a straight, branched, or cyclic 
hydrocarbon containing 2-20, preferably 2-17, more preferably 2-10, even more preferably 
2-8, most preferably 2-4, carbon atoms and at least one, preferably 1-3, more preferably 1-2, 
most preferably one, carbon to carbon double bond. In the case of a cycloalkyl group, 
conjugation of more than one carbon to carbon double bond is not such as to confer 
aromaticity to the ring. Carbon to carbon double bonds may be either contained within a . 
cycloalkyl portion, with the exception of cyclopropyl, or within a straight chain or branched 
portion. Examples of alkenyl groups include ethenyl, propenyl, isopropenyl, butenyl, 
cyclohexenyl, cyclohexenylalkyl and the like. A substituted alkenyl is the straight chain 
alkenyl, branched alkenyl or cycloalkenyl group defined previously, independently 
substituted with 1 to 3 groups or substituents of halo, hydroxy, alkoxy, alkylthio, 
alkylsulfinyl, alkylsulfonyl, acyloxy, aryloxy, heteroaryloxy, amino optionally mono- or di- 
substituted with alkyl, aryl or heteroaryl groups, amidino, urea optionally substituted with 
alkyl, aryl, heteroaryl or heterocyclyl groups, aminosulfonyl optionally N-mono- or N,N-di- 
substituted with alkyl, aryl or heteroaryl groups, alkylsulfonylamino, arylsulfonylamino, 
heteroarylsulfonylamino, alkylcarbonylamino, arylcarbonylamino, 
heteroarylcarbonylamino, carboxy, alkoxycarbonyl, aryloxycarbonyl, 
heteroaryloxycarbonyl, or the like attached at any available point to produce a stable 
compound. 
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[0133] "Alkyny?' - alone or in combination means a straight or branched hydrocarbon 
containing 2-20, preferably 2-17, more preferably 2-10, even more preferably 2-8, most 
preferably 2-4, carbon atoms containing at least one, preferably one, carbon to carbon triple 
bond. Examples of alkynyl groups include ethynyl, propynyl, butynyl and the like. A 
substituted alkynyl refers to a straight chain alkynyl or branched alkynyl, independently 
substituted with 1 to 3 groups or substituents of halo, hydroxy, alkoxy, alkylthio, 
alkylsulfinyl, alkylsulfonyl, acyloxy, aryloxy, heteroaryloxy, amino optionally mono- or di- 
substituted with alkyl, aryl or heteroaryl groups, amidino, urea optionally substituted with 
alkyl, aryl, heteroaryl or heterocyclyl groups, aminosulfonyl optionally N-mono- or N,N-di- 
substituted with alkyl, aryl or heteroaryl groups, alkylsulfonylamino, arylsulfonylamino, 
heteroarylsulfonylamino, alkylcarbonylamino, arylcarbonylamino, 
heteroarylcarbonylamino, or the like attached at any available point to produce a stable 
compound. 

[0134] '^Alkyl alkenyl" refers to a groTjip -R-CR'=CR*" R"", where R is lower alkylene, 
or s^abstituted lower alkylene, R\ R"\ R"" may mdependently be hydrogen, halogen, lower 
alk I, substituted lower alkyl, acyl, aryl, substituted aryl, heteroaryl, or substituted 
hete. oaryl as defined below. 

[01351 yl alkynyl" refers to a groups -RCCR' where R is lower alkylene or 

substituted louver alkylene, R' is hydrogen, lower alkyl, substituted lower alkyl, acyl, aryl, 
substituted aryl, heteroaryl, or substituted heteroaryl as defined below. 

[0136] '"Alkoxy" denotes the group -OR, where R is lower alkyl, substituted lower alkyl;, 
acyl, aryl, substituted aryl, aralkyi, substituted aralkyi, heteroalkyi, heteroarylalkyl, 
cycloalkyl, substituted cycloalkyl, cycloheteroalkyl, or substituted cycloheteroalkyl as 
defined. 

[0137] "Alkylthio" or "thioalkoxy" denotes the group -SR, -S(0)n=i.2-R, where R is lower 
alkyl, substituted lower alkyl, aryl, substituted aryl, aralkyi or substituted aralkyi as defined 
herein. 

[0138] "Acyl" denotes groups -C(0)R, where R is hydrogen, lower alkyl substituted 
lower alkyl, aryl, substituted aryl and the like as defined herein. 

[0139] "Aryloxy" denotes groups -OAr, where Ar is an aryl, substituted aryl, heteroaryl, 
or substituted heteroaryl group as defined herein. 
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10140] "Amino'' or substituted amine denotes the group NRR% where R and R' may 
independently by hydrogen, lower alkyl, substituted lower alkyl, aryl, substituted aryl, 
heteroaryl, or substituted heteroaryl as defined herein, acyl or sulfonyl. 

[0141] "Amido" denotes the group -C(0)NRR% where R and R' may independently by 
hydrogen, lower alkyl, substituted lower alkyl, aryl, substituted aryl, heteroaryl, substituted 
heteroaryl as defined herein. 

[0142] "Carboxyl" denotes the group -C(0)OR, where R is hydrogen, lower alkyl, 
substituted lower alkyl, aryl, substituted aryl, heteroaryl, and substituted heteroaryl as 
defined herein. 

[0143] "Carbocyclic" refers to a saturated, unsaturated, or aromatic group having a single 
ring (e.g., phenyl) or multiple condensed rings where all ring atoms are carbon atoms, 
which can optionally be unsubstituted or substituted with, e.g., halogen, lower alkyl, lower 
alkoxy, alkylthio, acetylene, amino, amido, carboxyl, hydroxy!, aryl, aryloxy, heterocycle, 
heteroaryl, substituted heteroaryl, nitro, cyano, thiol, sulfamido and the like. 

[0144] "Aryl" - alone or in combination means phenyl or naphthyl optionally carbocyclic 
fused with a cycloalkyl of preferably 5-7, more preferably 5-6, ring members and/or 
optionally substituted with 1 to 3 groups or substituents of halo, hydroxy, alkoxy, alkylthio, 
alkylsulfinyl, alkylsulfonyl, acyloxy, aryloxy, heteroaryloxy, amino optionally mono- or di- 
substituted with alkyl, aryl or heteroaryl groups, amidino, urea optionally substituted with 
alkyl, aryl, heteroaryl or heterocyclyl groups, aminosulfonyl optionally N-mono- or N,N-di- 
substituted with alkyl, aryl or heteroaryl groups, alkylsulfonylamino, arylsulfonylamino, 
heteroarylsulfonylamino, alkylcarbonylamino, arylcarbonylamino, 
heteroarylcarbonylanodno, or the like. 

[0145] "Substituted aryF' refers to aryl optionally substituted with one or more functional 
groups, e.g., halogen, lower alkyl, lower alkoxy, alkylthio, acetylene, amino, amido, 
carboxyl, hydroxyl, aryl, aryloxy, heterocycle, heteroaryl, substituted heteroaryl, nitro, 
cyano, thiol, sulfamido and the like. 

[0146] "Heteroaryl" - alone or in combination means a monocyclic aromatic ring 
structure containing 5 or 6 ring atoms, or a bicyclic aromatic group having 8 to 10 atoms, 
containing one or more, preferably 1-4, more preferably 1-3, even more preferably 1-2, 
heteroatoms independently selected from the group O, S, and N, and optionally substituted 
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with 1 to 3 groups or substituents of halo, hydroxy, alkoxy, alkylthio, alkylsulfinyl, 
alkylsulfonyl, acyloxy, aryloxy, hetero^juyloxy, amino optionally mono- or di-substituted 
with alkyl, aryl or heteroaryl groups, amidino, urea optionally substituted with alkyl, aryl, 
heteroaryl or heterocyclyl groups, aminosulfonyl optionally N-mono- or N^N-di-substituted 
with aDcyl, aryl or heteroaryl groups, alkylsulfonylamino, arylsulfonylamino, 
heteroarylsulfonylamino, alkylcarbonylamino, arylcarbonylamino, 

heteroarylcarbonylarnino, or the like. Heteroaryl is also intended to include oxidized S or 
N, such as sulfinyl, sulfonyl and N-oxide of a tertiary ring nitrogen. A carbon or nitrogen 
atom is the point of attachment of the heteroaryl ring structure such that a stable aromatic 
ring is retained. Examples of heteroaryl groups are pyridinyl, pyridazinyl, pyrazinyl, 
quinazolinyl, purinyl, indolyl, quinolinyl, pyrimidinyl, pyrrolyl, oxazolyl, thiazolyl, thienyl, 
isoxazolyl, oxathiadiazolyl, isothiazolyl, tetrazolyl, imidazolyl, triazinyl, furanyl, 
benzofuryl, indolyl and the like. A substituted heteroaryl contains a substituent attached at 
an available carbon or nitrogen to produce a stable compound. 

[01471 "Heterocyclyl" - alone or in combination means a non-aromatic cycloalkyl group 
having from 5 to 10 atoms in which from 1 to 3 carbon atoms in the ring are replaced by 
heteroatoms of O, S or N, and are optionally benzo fused or fused heteroaryl of 5-6 ritig 
members and/or are optionally substituted as in the case of cycloalkyl. Heterocycyl is also 
intended to include oxidized S or N, such as sulfinyl, sulfonyl and N-oxide of a tertiary ring 
nitrogen. The point of attachment is at a carbon or nitrogen atom. Examples of 
heterocyclyl groups are tetrahydrofuranyl, dihydropyridinyl, piperidinyl, pyrrolidinyl, 
piperazinyl, dihydrobenzoforyl, dihydroindolyl, and the like. A substituted hetercyclyl 
contains a substituent nitrogen attached at an available carbon or nitrogen to produce a 
stable compound. 

[0148] "Heterocycle" refers to a saturated, unsaturated, or aromatic carbocyclic group 
having a single ring (e.g., morpholino, pyridyl or furyl) or multiple condensed rings (e.g., 
naphthpyridyl, quinoxalyl, quinolinyl, indolizinyl or benzo[b]thienyl) and having at least 
one hetero atom, such as N, O or S, within the ring, which can optionally be unsubstituted 
or substituted with, e.g., halogen, lower alkyl, lower alkoxy, alkylthio, acetylene, amino, 
amido, carboxyl, hydroxyl, aryl, aryloxy, heterocycle, heteroaryl, substituted heteroaryl, 
nitro, cyano, thiol, sulfamido and the like. 
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[0149] '^Substituted heteroaryr' refers to a heterocycle optionally mono or poly 
substituted with one or more functional groups, e.g., halogen, lower alkyl, lower alkoxy, 
alkylthio, acetylene, amino, amido, carboxyl, hydroxyl, aryl, aryloxy, heterocycle, 
substituted heterocycle, heteroaryl, substituted heteroaryl, nitro, cyano, thiol, sulfamido and 
the like- 

{01501 "Aralkyl" refers to the group -R-Ar where Ar is an aryl group and R is lower alkyl 
or substituted lower alkyl group. Aryl groups can optionally be unsubstituted or substituted 
with, e.g., halogen, lower alkyl, alkoxy, alkylthio, acetylene, amino, amido, carboxyl, 
hydroxyl, aryl, aryloxy, heterocycle, substituted heterocycle, heteroaryl, substituted 
heteroaryl, nitro, cyano, thiol, sulfamido and the like. 

[0151] "Heteroalkyl" refers to the group -R-Het where Het is a heterocycle group and R 
is a lower alkyl group. HeteroalkyI groups can optionally be unsubstituted or substituted 
with e.g., halogen, lower alkyl, lower alkoxy, alkylthio, acetylene, amino, amido, carboxyl, 
aryl, aryloxy, heterocycle, substituted heterocycle, heteroaryl, substituted heteroaryl, nitro, 
cyanp, thiol, sulfamido and the like. 

[0152] "Heteroarylalkyr' refers to the group -R-HetAr where HetAr is an heteroaryl 
group and R lower alkyl or substituted lower alkyl. Heteroarylalkyl groups can optionally 
be unsubstituted or substituted with, e.g., halogen, lower alkyl, substituted lower alkyl, 
alkoxy, alkylthio, acetylene, aryl, aryloxy, heterocycle, substituted heterocycle, heteroaryl, 
substituted heteroaryl, nitro, cyano, thiol, sulfamido and the like. 

[0153] "Cycloalkyl" refers to a cyclic or polycyclic alkyl group containing 3 to 15 carbon 
atoms. 

{01541 "Substituted cycloalkyr' refers to a cycloalkyl group comprising one or more 
substituents with, e.g., halogen, lower alkyl, substituted lower alkyl, alkoxy, alkylthio, 
acetylene, aryl, aryloxy, heterocycle, substituted heterocycle, heteroaryl, substituted 
heteroaryl, nitro, cyano, thiol, sulfamido and the like. 

[0155] "Cycloheteroalkyl" refers to a cycloalkyl group wherein one or more of the ring 
carbon atoms is replaced with a heteroatom (e.g., N, O, S or P). 

[0156] "Substituted cycloheteroalkyl" refers to a cycloheteroalkyl group as herein defined 
which contains one or more substituents, such as halogen, lower alkyl, lower alkoxy. 
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alkylthio, acetylene, amino, amido, carboxyl, hydroxyl, aryl, aryloxy, heterocycle, 
substituted heterocycle, heteroaryl, substituted heteroaryl, nitro, cyano^ thioU sulfamido and 
the like. 

[0157] "Alkyl cycloalkyl" denotes the group -R-cycloalkyl where cycloalkyl is a 
cycloalkyl group and R is a lower alkyl or substituted lower alkyl. Cycloalkyl groups can 
optionally be unsubstituted or substituted with e.g. halogen, lower alkyl, lower alkoxy, 
alkylthio, acetylene, amino, amido, carboxyl, hydroxyl, aryl, aryloxy, heterocycle, 
substituted heterocycle, heteroaryl, substituted heteroaryl, nitro, cyano, thiol, sulfamido and 
the like. 

[0158] "Alkyl cycloheteroalkyl" denotes the group -R-cycloheteroalkyl where R is a 
lower alkyl or substituted lower alkyl. Cycloheteroalkyl groups can optionally be 
unsubstituted or substituted with e.g. halogen, lower alkyl, lower alkoxy, alkylthio, amino, 
amido, carboxyl, acetylene, hydroxyl, aryl, aryloxy, heterocycle, substituted heterocycle, 
heteroaryl, substituted heteroaryl, nitro, cyano, thiol, sulfamido and the like. 

[0159] An additional aspect of this invention relates to pharmaceutical formulations or 
compositions, that include a therapeutically effective amount of a compound of Formula I, 
n. III, IV, V, VI, or VII, (or a compound within a sub-group of compounds within any of 
those generic formulas) and at least one pharmaceutically acceptable carrier or excipient. 
The composition can include a plurality of different pharmacalogicaily active compoimds. 

[0160] As used herein, the term "pharmaceutical composition" refers to a preparation that 
includes a therapeutically significant quantity of an active agent, that is prepared in a form 
adapted for administration to a subject. Thus, the preparation does not include any 
component or components in such quantity that a reasonably pmdent medical practitioner 
would find the preparation unsuitable for administration to a normal subject. In many cases, 
such a pharmaceutical composition is a sterile preparation. 

[0161] In a related aspect, the invention provides kits that include a pharmaceutical 
composition as described herein. In particular embodiments, the pharmaceutical 
composition is packaged, e.g., in a vial, bottle, flask, which may be further packaged, e.g., 
within a box, envelope, or bag; the pharmaceutical composition is approved by the U.S. 
Food and Drug Administration or similar regulatory agency for administration to a 
mammal,e.g-, a human; the pharmaceutical composition is approved for administration to a 
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mammal, e.g., a human for a kinase-mediated disease or condition; the kit includes written 
instructions or other in4jg^^that the composition is suitable or approved for 
administration to a mamm'air^-g-. a human, for a kinase-mediated disease or condition; the 
pharmaceutical composition is packaged in unit does or single dose form, e.g., single dose 
pills, capsules, or the Hke. 

[0162] In another related aspect, compoxmds of any of Formulas I-VII can be used in the 
preparation of a medicament for the treatment of a kinase-mediated disease or condition or a 
disease or condition in which modulation of a kinase provides a therapeutic benefit. 

[0163] In the present context, the term ^therapeutically effective" indicates that the 
materials or amount of material is effective to prevent, alleviate, or ameliorate one or more 
symptoms of a disease or medical condition, and/or to prolong the survival of the subject 
being treated. 

[01 64] The term "pharmaceutically acceptable" indicates that the indicated material does 
not have properties that would cause a reasonably prudent medical practitioner to avoid 
administration of the material to a patient, taking into consideration the disease or 
conditions to be treated and the respective route of administration. For example, it is 
commonly required that such a material be essentially sterile, e.g., for injectibles. 

[01 65J "A pharmaceutically acceptable salt" is intended to mean a salt that retains the 
biological effectiveness of the free acids and bases of the specified compound and that is not 
biologically or otherwise unacceptable. A compoimd of the invention may possess a 
sufficiently acidic, a sufficiently basic, or both functional groups, and accordingly react with 
any of a number of inorganic or organic bases, and inorganic and organic acids, to form a 
pharmaceutically acceptable salt. Exemplary pharmaceutically acceptable salts include 
those salts prepared by reaction of the compoxmds of the present invention with a mineral or 
organic acid or an inorganic base, such as salts including sodium, chloride, sulfates, 
pyrosulfates, bisulfates, sulfites, bisulfites, phosphates, monohydrogenphosphates, 
dihydrogenphosphates, metaphosphates, pyrophosphates, chlorides, bromides, iodides, 
acetates, propionates, decanoates, caprylates, acrylates, formates, isobutyrates, caproates, 
heptanoates, propiolates, oxalates, malonates, succinates, suberates, sebacates, fumarates, 
maleates, butyne-1,4 dioates, hexyne-l,6-dioates, benzoates, chlorobenzoates, 
methylbenzoates, dinitrobenzoates, hydroxybenzoates, methoxybenzoates, phthalates, 
sulfonates, xylenesulfonates, phenylacetates, phenylpropionates, phenylbutyrates, citrates, 
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lactates, . gamma. -hydroxybutyrates, glycollates, tartrates, methane-sulfonates, 
propanesulfonates, naphthalene-1 -sulfonates, naphthalene-2-sulfonates, and mandelates. 

[01 66] The term '^pharmaceutically acceptable metabolite" refers to a pharmacologically 
acceptable product, which may be an active product, produced through metabolism of a 
specified compound (or salt thereof) in the body of a subject or patient. Metabolites of a 
compound may be identified using routine techniques known in the art, and their activities 
determined using tests such as those described herein. For example, in some compounds, 
one or more alkoxy groups can be metabolized to hydroxyl groups while retaining 
phamiacologic activity and/or carboxyl groups can be esterified, e.g., glucuronidation. In 
some cases, there can be more than one metabolite, where an intermediate metabolite(s) is 
further metabolized to provide an active metabolite. For example, in some cases a 
derivative compound resulting from metabolic glucuronidation miay be inactive or of low 
activity, and can be further metabolized to provide an active metabolite. 

[0167] Additional aspects and embodiments will be apparent from the following Detailed 
Description and from the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0168] FIGURE 1 shows a schematic representation of AMP-PNP in the binding site of 
PIM-1, showing conserved interacting residues. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0169] The Tables will first be briefly described. 

.[0170] Table 1 provides atomic coordinates for human PIM-1 . In this table and in Table 
4, the various columns have the following content, beginning with the left-most column: 
ATOM: Refers to the relevant moiety for the table row. 

Atom number: Refers to the arbitrary atom nimiber designation within the coordinate table. 
Atom Name: Identifier for the atom present at the particular coordinates. 
Chain ID: Cham ID refers to one monomer of the protein in the crystal, chain "A", or 
to other compound present in the crystal, e.g., HOH for water, and L for a ligand or binding 
compound. Multiple copies of the protein monomers will have different chain Ids. 
Residue Number: The amino acid residue number in the chain. 
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X, Y, Z: Respectively are the X, Y, and Z coordinate values. 

Occupancy; Describes the jfraction of time the atom is observed in the crystal. For 

example, occupancy = 1 means that the atom is present all the time; occupancy = o.5 

indicates that the atom is present in the location 50% of the time. 

B-factor: A measure of the thermal motion of the atom. 

Element: Identifier for the element- 

[01 71] Table 2 provides an alignment of catalytic domains of several PIM kinases, 
including human PIM-1, PIM-2, and PIM-3 as well as PIM kinases from other species. 

(01721 Table 2 provides an alignment of catalytic domains of several PIM kinases, 
including human PIM-1, PIM-2, and PTM-3 as well as PIM kinases from other species. 
Sequences from the following species are included in the alignment: Hs, Homo sapiens; 
Mm, Mus musculus; Dr, Danio rerio; XI, Xenopus laevis; Cc, Cotumix cotumix; and Ce, 
Caenorhabditis elegans. Residues with >90% and >75% conservations are in red and 
yellow background, respectively. Phosphate binding sites are indicated by puiple circles. 
Residues that are invariably involved in ligand binding are indicated by filled uparrows, 
whereas residues that can be involved in ligand binding are indicated by open uparrows. 
The backbone atoms of two residues (indicated by leftairows) in the hinge region have been 
shown to make hydrogen bonds to ligands in many known kinase/ligand complex 
structures. Note that PIM family kinases all have Pro as the second residue, resulting in 
theloss of a hydrogen bond donor. 

[0173] Table 4 provides atomic coordinates for PIM-1 with AMP-PNP in the binding site. 

(01741 Table 5 provides a list of kinases which have been correlated with diseases (or 
pathological condition). 

I. Introduction 

[0175] The present invention concerns the use of certain molecular scaffolds in the 
development of kinase modulators, e.g., kinase inhibitors. Such development can utilize 
kinase stmctures, for example, PIM kinase structures, structural information, and related 
compositions for identifying compounds that modulate kinase activity and for determining 
stmctures of other kinases. The following description utilizes PIM-1 for illustrative 
purposes. However, the invention is not limited to PIM-1; other protein kinases can also be 
utilized for modulation by the present compounds, and for developing additional modulators 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 PCT/US2004/030360 

40 

based on the present molecular scaffolds. The use of molecular scaffolds in connection with 
PDSd kinases and Pyk2 ^Ma^^^^^ Appl. 10/664,421 and corresponding 

PCT/US03/29415, and Fofniula I as described in U.S. Appl. 10/789,818 aad corresponding 
PCT/US2004/005904 respectively. Both of those applications are incorporated herein by 
reference in their entireties, including drawings. 

10176] Kinases, e.g., PIM-1 , are involved in a variety of disease conditions, and a number 
have been utilized as therapeutic targets. 

Exemplary Diseases Associated with Kinases 

(01 77] As indicated above, the present invention is exemplified in connection with PIM- 1 . 
As indicated in the Background above, PlM-1 functions as a weak oncogene. In transgenic 
mice with PIM-1 driven by Emu enhancer sequences, overexpression of PIM-1 by itself it 
does not lead to tumor formation, but does so in conjunction with overexpression of a 
second oncogenic gene. In 75% of tumors over-expressing PIM-l, the second gene found 
to be overexpressed v/?f> c-myc (van der Houven van Oordt CW, Schouten TG, van Krieken 
JH, van Dierendonck J. I, van der Eb AJ, Breuer ML,(1998) X-ray-induced 
lymphomagenesis in E i r^^-PB:!- 1 transgenic mice: an investigation of the co-operating 
molecular events. Carcinogencns 19:847-853). Other PIM kinases are also involved, as the 
functions of the various P'M kivi a ves appears to be at least partially complementary. 

[0178] Since PIM-1 is a protooicasene and it closely cooperates with other 
protooncogenes like c-myc in triggering intracellular signals leading to cell transformation, 
PIM-1 inhibitors have therapeutic applications in the treatment of various cancers, as wells 
as other disease states. Some examples are described below. 

Prostate cancer 

[01 79] A significant inter-relationship between PIM-1 and a disease state was reported in 
prostate cancer (Dhanasekaran et al. (2001) Delineation of prognostic biomarkers in 
prostate cancer. Nature 412: 822-826.) Using microarrays of complementary DNA, the gene 
expression profiles of approximately 10,000 genes from more than 50 noraial and neoplastic 
prostate cancer specimens and three corautnon prostate cancer cell lines were examined. Two 
of these genes, hepsin, a transmembrane serine protease, and PIM-1, a serine/threomne 
kinase are upregulated to several-fold. The PIM-1 kinase is strongly expressed in the 
cytoplasm of prostate cancer tissues while the normal tissues showed no or weak staining 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 PCT/US2004/030360 

41 

with anti-PIM-1 antibody (Id,) indicating PIM-1 is an appropriate target for drug 
development. 

Leukemia 

[0180] PIM-1 has been mapped to the 6p21 chromosomal region in humans. Nagarajan et 
al. (Nagarajan et al. (1986) Localization of the human pirn oncogene (PIM) to a region of 
chromosome 6 involved in translocations in acute leukemias. Proc. Natl Acad. Scu USA 
83:2556-2560) reported increased expression of PIM-1 in K562 erythroleukenaia cell lines 
which contain cytogenetically demonstrable rearrangement in the 6p21 region. A 
characteristic chromosome anomaly, a reciprocal translocation t(6;9)(p21;q33), has been 
described in myeloid leukemias that may be due to involvement of PBM-l. Amson et al. 
(1989) also observed overexpression in 30 % of myeloid and lymphoid acute leukemia. 
These studies also indicate a role for PIM-1 protooncogene during development and in 
deregulation in various leukemias. 

Kaposi Sarcoma 

10181] Analysis of gene expression profiles by microarrays in human hematopoietic cells 
after in vitro infection with human Herpes virus ( HHV 8), also known as Kaposi Sarcoma 
associated vims (KSHV)^ resulted in differential expression of 400 genes out of about 
10,000 analyzed. Of these four hundred genes, PIM-2 is upregulated more than 3.5 fold 
indicating PIM-2 as a potential target for therapeutic intervention. Thus, inhibitors selective 
to PIM-2 are of great therapeutic value in treating disease states mediated by HHV8 
(Mikovits et al. (2001) Potential cellular signatures of viral infections in human 
hematopoietic cells. Dis. Markers 17:173-178.) 

Asthma and Allergy. 

[0182] The increase in eosinophils at the site of antigen challenge has been used as 
evidence that eosinophils play a role in pathophysiology of asthma. Aberrant production of 
several different cytokines has been shown to result in eosinophilia. The cytokine IL-5 for 
example influences the development and maturation of eosinophils in a number of ways. 
Using microarray techniques, a role for PIM-1 in IL-5 signaling pathway in eosinophils was 
indicated. (Temple et al. (2001) Microarray analysis of eosinophils reveals a number of 
candidate survival and apoptosis genes. Am. J. Respir. CellMol Biol 25: 425-433.) Thus, 
inhibitors of PIM-1 can have therapeutic value in treatment of asthma and allergies. 
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Inflammation 

[01 83] PIM-l and/or the compounds described herein can also be useful for treatment of 
inflammation, either chronic or acute. Chronic inflammation is regarded as prolonged 
inflammation (weeks or months), involving simultaneous active inflammation, tissue 
destmction, and attempts at healing. (R.S. Cotran, V. Kumar, and S.L, Robbins, Saunders 
Co., (1989) Robbins Pathological Basis of Disease, p.75.) Although chronic inflammation 
can follow an acute inflammatory episode, it can also begin as a process that progresses 
over time, e.g., as a result of a chronic infection such as tuberculosis, syphilis, fungal 
infection which causes a delayed hypersensitivity reaction, prolonged exposure to 
endogenous or exogenous toxins, or autoimmune reactions (e^., rheumatoid arthritis, 
systemic lupus erythematosus, multiple sclerosis, psoriasis). Chronic inflammatory disease 
thus include many common medical conditions such as autoimmiine disorders such as those 
listed above, chronic infections, surgical adhesions, chronic inflammatory lung and airway 
diseases (e.g., asthma, pneumoconiosis, chronic obstmctive pulmonary disease, nasal 
polyps, and pulmonary fibrosis). For skin and airway inflammatory disease, topical or 
inhaled forms of drag administration can be used respectively. 

[0184] While kinase-related diseases and conditions are exemplified in connection with 
PIM-l, numerous other kinases have been correlated with particular disease and conditions, 
and are identified as target in a number of current treatments. Compounds derived from the 
present scaffolds can be developed to target such additional kinases for treating associated 
diseases and conditions. 

II. Crystalline Kinases 

[0185] In development of kinase modulators based on molecular scaffolds, crystalline 
kinases (e.g., human PIM-l) include native crystals, derivative crystals and co-crystals are 
useful. Native crystals generally comprise substantially pure polypeptides corresponding to 
the kinase in crystalline form. Crystal structures for a number of different kinases (or 
kinase domains) have been determined and are available for use in the present methods. 

[0186] It is to be understood that the crystalline kinases are not limited to naturally 
occurring or native kinase. Indeed, the crystals of the invention include crystals of mutants 
of native kinases. Mutants of native kinases are obtained by replacing at least one amino 
acid residue in a native kinase with a different amino acid residue, or by adding or deleting 
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amino acid residues within the native polypeptide or at the N- or C-terminus of the native 
polypeptide, and have substaatially the same three-dimensional structure as the native 
kinase from which the mutant is derived. 

(0187] By having substantially the same three-dimensional structure is meant having a set 
of atomic structure coordinates that have a root-mean-square deviation of less than or equal 
to about 2 A when superimposed with the atomic structure coordinates of the native kinase 
from which the mutant is derived when at least about 50% to 100% of the Ca atoms of the 
native kinase domain are included in the superposition. 

[0188] Amino acid substitutions, deletions and additions which do not significantly 
interfere with the three-dimensional structure of the kinase will depend, in part, on the 
region of the kinase where the substitution, addition or deletion occurs. In highly variable 
regions of the molecule, non-conservative substitutions as well as conservative substitutions 
may be tolerated without significantly dismpting the three-dimensional, structure of the 
molecule. In highly conser/ed regions, or regions containing si,gnificant secondary 
structure, conservative amiiK acid substitutions are preferred. Such conserved and variable 
regions can be identified by s quence alignment of a particular kinase (e.g., PIM-1) with 
other kinases). Such alignmem o/ some kinases is provided in Table 3. 

[0189] Conservative amino acid s\;.bsl\tutions are well known in the art, and include 
substitutions made on the basis of simi.'miy in polarity, charge, solubility, hydrophobicity, 
hydrophilicity and/or the amphipathic nature of the amino acid residues involved. For 
example, negatively charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine and arginine; amino acids with uncharged polar head 
groups having similar hydrophilicity values include the following: leucine, isoleucine, 
valine; glycine, alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. 
Other conservative amino acid substitutions are well known in the art. 

[0190] For kinases obtained in whole or in part by chemical synthesis, the selection of 
amino acids available for substitution or addition is not limited to the genetically encoded 
amino acids. Indeed, the mutants described herein may contain non-genetically encoded 
amino acids. Conservative amino acid substitutions for many of the conmionly known non- 
genetically encoded amino acids are well known in the art. Conservative substitutions for 
other amino acids can be determined based on their physical properties as compared to the 
properties of the genetically encoded amino acids. 
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[0191] In some instances, it may be particularly advantageous or convenient to substitute, 
delete and/or add amino acid residues to a native kinase in order to provide convenient 
cloning sites in cDNA encoding the polypeptide, to aid in purification of the polypeptide, 
and for crystallization of the polypeptide. Such substitutions, deletions and/or additions 
which do not substantially alter the three dimensional structure of the native kinase domain 
will be apparent to those of ordinary skill in the art. 

[0192] It should be noted that the mutants contemplated herein need not all exhibit kinase 
activity. Indeed, amino acid substitutions, additions or deletions that int^ere with the 
kinase activity but which do not significantly alter the three-dimensional structure of the 
domain are specifically contemplated by the invention. Such crystalline polypeptides, or 
the atomic stmcture coordinates obtained therefi-om, can be used to identify compounds that 
bind to the native domain. These compounds can affect the activity of the native domain. 

[0193] The derivative crystals of the invention can comprise a crystalline kinase 
polypeptide in covalent association with one or more heavy metal atoms. The polypeptide 
may correspond to a native or a mutated kinase. Heavy metal atoms useful for providing 
derivative crystals include, by way of example and not limitation, gold, mercury, selenium, 
etc. 

[0194] The co-crystals of the invention generally comprise a crystalline kinase domain 
polypeptide in association with one or more compounds. The association may be covalent 
or non-covalent. Such compounds include, but are not limited to, cofactors, substrates, 
substrate analogues, inhibitors, allosteric effectors, etc. 

[0195] Exemplary mutations for PIM family kinases include the substitution or of the 
proline at the site corresponding to residue 123 in human PIM-1 . One useful substitution is 
a proline to methionine substitution at residue 123 (P123M). Such substitution is useful, for 
example, to assist in using PIM family kinases to model other kinases that do not have 
proline at that site. Additional exemplary mutations include substitution or deletion of one 
or more of PIM-1 residues 124-128 or a residue firom another PIM aligning with PIM-1 
residues 124-128. For example, a PIM residue aligning with PIM-1 residue 128 can be 
deleted. Mutations at other sites can likewise be carried out, eg. , to make a mutated PIM 
family kinase more similar to another kinase for structure modeling and/or compound fitting 
purposes. 
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m. Three Dimensional Structure Determination Using X-ray Crystallography 

[0196] X-ray crystallography is a method of solving the three dimensional structures of 
molecules. The structure of a molecule is calculated from X-ray diffraction patterns using a 
crystal as a diffraction grating. Three dimensional structures of protein molecules arise from 
crystals grown from a concentrated aqueous solution of that protein. The process of X-ray 
crystallography can include the following steps: 

(a) synthesizing and isolating (or otherwise obtaining) a polypeptide; 

(b) growing a crystal from an aqueous solution comprising the polypeptide with 
or without a modulator; and 

(c) collecting X-ray diffraction patterns from the crystals, determining unit cell 
dimensions and symmetry, determining electron density, fitting the amino 
acid sequence of the polypeptide to the electron density, and refining the 
stracture. 

Production of Polypeptides 

[0197] The native and mutated kinase polypeptides described herein may be chemically 
synthesized in whole or part using techniques that are well-known in the art (see, e.g^., 
Creighton (1983) Biopolymers 22(l):49-58). 

[0198] Altematively, methods which are well known to those skilled in the art can be used 
to construct expression vectors containing the native or mutated kinase polypeptide coding 
sequence and appropriate transcriptional/translational control signals. These methods 
include in vitro recombinant DNA techniques, synthetic techniques and in vivo 
recombination/genetic recombination. See, for example, the techniques described in 
Maniatis, T (1989). Molecular cloning: A laboratory Manual . Cold Spring Harbor 
Laboratory, New York. Cold Spring Harbor Laboratory Press; and Ansubel, F.M. et al. 
(1994) Current Protocols in Molecular Biology . John Wiley & Sons, Secaucus, N.J. 

[0199] A variety of host-expression vector systems may be utilized to express the kinase 
coding sequence. These include but are not limited to microorganisms such as bacteria 
transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing the kinase domain coding sequence; yeast transformed with 
recombinant yeast expression vectors containing the kinase domain coding sequence; insect 
cell systems infected with recombinant virus expression vectors (e.g., baculovirus) 
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containing the kinase domain coding sequence; plant cell systems infected with recombinant 
virus expression vectors (e,g.^ cauliflower mosaic virus, CaMV; tobacco mosaic viras, 
TMV) or transformed with recombinant plasmid expression vectors (e,g., Ti plasmid) 
containing the kinase domain coding sequence; or animal cell systems. The expression 
elements of these systems vary in their strength and specificities. 

[0200] Depending on the host/vector system utilized, any of a number of suitable 
transcription and translation elements, including constitutive and inducible promoters, may 
be used in the expression vector. For example, when cloning in bacterial systems, inducible 
promoters such as pL of bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the 
like may be used; when cloning in insect cell systems, promoters such as the baculovirus 
polyhedrin promoter may be used; when cloning in plant cell systems, promoters derived 
from the genome of plant cells {e.g,, heat shock promoters; the promoter for the small 
subimit of RUBISCO; the promoter for the chlorophyll a/b binding protein) or from plant 
viruses (e-^., the 35S RNA promoter of CaMV; the coat protein promoter of TMV) may be 
used; when cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells metalloMonein promoter) or from mammalian viruses (e.g., the 
adenovirus late promoter; tiiie vaccinia virus 7.5K promoter) may be used; when generating 
cell lines that contain multiple copies of the kinase domain DNA, SV40-, BPV- and EBV- 
based vectors may be used with an appropriate selectable marker. 

[0201 1 Exemplary methods describing methods of DNA manipulation, vectors, various 
types of cells used, methods of incorporating the vectors into the cells, expression 
techniques, protein purification and isolation methods, and protein concentration methods 
are disclosed in detail in PCT publication WO 96/18738. This publication is incorporated 
herein by reference in its entirety, including any drawings. Those skilled in the art will 
appreciate that such descriptions are apphcable to the present invention and can be easily 
adapted to it. 

Crystal Growth 

[02021 Crystals are grown from an aqueous solution containing the purified and 
concentrated polypeptide by a variety of techniques. These techniques include batch, liquid, 
bridge, dialysis, vapor diffusion, and hanging drop methods. McPherson (1982) John Wiley, 
New York; McPherson (1990) Eur. J. Biochem. 189:1-23; Webber {\99\) Adv. Protein 
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Chem, 41 : 1-36, incorporated by reference herein in their entireties, including all figures, 
tables, and drawings. 

[02031 The native crystals of the mvention are, in general, grown by adding precipitants to 
the concentrated solution of the polypeptide. The precipitants are added at a concentration 
just below that necessary to precipitate the.protein. Water is removed by controlled 
evaporation to produce precipitating conditions, which are maintained until crystal growth 
ceases. 

[0204] For crystals of the invention, exemplary crystallization conditions are described in 
the Examples. Those of ordinary skill in the art will recognize that the exemplary 
crystallization conditions can be varied. Such variations may be used alone or in 
combination. In addition, other crystallizations may be found, e.g., by using crystallization 
screening plates to identify such other conditions. 

[0205] Derivative crystals of the invention can be obtained by soaking native crystals in 
mother liquor containing salts of heavy metal atoms. It has been found that soaking a native 
crystal in a solution containing about 0.1 mM to about 5 mM fhimerosal, 4- 
chloromeruribenzoic acid or KAu(CN)2 for about 2 hr to about 72 hr provides derivative 
crystals suitable for use as isomorphous replacements in determining the X-ray crystal 
structure of PIM-1. 

[0206] Co-crystals of the invention can be obtained by soaking a native crystal in mother 
liquor containing compound that binds the kinase, or can be obtained by co-crystallizing the 
kinase polypeptide in the presence of a binding compound. 

[0207] Generally, co-crystallization of kinase and binding compound can be accomplished 
using conditions identified for crystallizing the corresponding kinase without binding 
compound. It is advantageous if a plurality of different crystallization conditions have been 
identified for the kinase, and these can be tested to determine which condition gives the best 
co-crystals. It may also be beneficial to optimize the conditions for co-crystallization. 
Exemplary co-crystallization conditions are provided in the Examples. 

Determining Unit Cell Dimensions and the Three Dimensional Stmcture of a 
Polypeptide or Polypeptide Complex 
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[0208] Once the crystal is grown, it can be placed in a glass capillary tube or other 
mounting device and mounted onto a holding device connected to an X-ray generator and 
an X-ray detection device. Collection of X-ray dif&action patterns are well documented by 
those in the art. See, e.g,, Ducruix and Geige, (1992), IRL Press, Oxford, England, and 
references cited therein. A beam of X-rays enters the crystal and then diffracts from the 
crystal. An X-ray detection device can be utilized to record the diffraction patterns 
emanating from the crystal. Although the X-ray detection device on older models of these 
instruments is a piece of film, modem instruments digitally record X-ray diffraction 
scattering. X-ray sources can be of various types, but advantageously, a high intensity 
source is used, e.g., a synchrotron beam source. 

[02091 Methods for obtaining the three dimensional structure of the crystalline form of a 
peptide molecule or molecule complex are well known in the art. See, e.g., Ducmix and 
Geige, (1992), IRL Press, Oxford, England, and references cited therein. The following are 
steps in the process of deteniiining the three dimensional structure of a molecule or complex 
from X-ray diffraction data. 

[0210] After the X-ray diffraction patterns are collected from the crystal, the unit cell 
dimensions and orientation in the crystal can be determined. They can be determined from 
the spacing between the diffraction emissions as well as the patterns made from these 
emissions. The imit cell dimensions are characterized in three dimensions in units of 
Angstroms (one A= 10'^° meters) and by angles at each vertices. The symmetry of the unit 
cell in the crystals is also characterized at this stage. The symmetry of the unit cell in the 
crystal simplifies the complexity of the collected data by identifying repeating patterns. 
Application of the symmetry and dimensions of the unit cell is described below. 

[021 1] Each dif&action pattern emission is characterized as a vector and the data collected 
at this stage of the method determines the amplitude of each vector. The phases of the 
vectors can be determined using multiple techniques. In one method, heavy atoms can be 
soaked into a crystal, a method called isomorphous replacement, and the phases of the 
vectors can be determined by using these heavy atoms as reference points in the X-ray 
analysis. (Otwinowski, (1991), Daresbury, United Kingdom, 80-86). The isomorphous 
replacement method usually utilizes more than one heavy atom derivative. In another 
method, the amplitudes and phases of vectors from a crystalline polypeptide with an already 
determined stmcture can be applied to the amplitudes of the vectors from a crystalline 
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polypeptide of unknown structure and consequently determine the phases of these vectors. 
This second method is known as molecular replacement and the protein structure which is 
used as a reference must have a closely related structure to the protein of interest. (Naraza 
(1994) Proteins 1 1:281-296). Thus, the vector information from a kinase of known 
stmcture, such as those reported herein, are useful for the molecular replacement analysis of 
another kinase with unknown stmcture. 

(02121 Once the phases of the vectors describing the unit cell of a crysttai are determined, 
the vector amphtudes and phases, imit cell dimensions, and unit cell symmetry can be used 
as terms in a Fourier transform function. The Fourier transform fonction calculates the 
electron density in the unit cell from these measurements. The electron density that 
describes one of the molecules or one of the molecule complexes in the unit cell can be 
referred to as an electron density map. The amino acid structures of the sequence or the 
molecular structures of compounds complexed with the crystalline polypeptide may then be 
fitted to the electron density using a variety of computer programs. This step of the process 
is sometimes referred to as model building and can be accomplished by using computer 
programs such as Turbo/FRODO or "O^ {Jon&s (19^5) Methods in Enzymology 115:157- 
171). 

[0213] A theoretical electron density map can then be calculated from the amino acid 
structures fit to the experimentally determined electron density. The theoretical and 
experimental electron density maps can be compared to one another and the agreement 
between these two m^s can be described by a parameter called an R-factor. A low value 
for an R-factor describes a high degree of overlapping electron density between a theoretical 
and experimental electron density map. 

[0214] The R-factor is then minimized by using computer programs that refine the 
theoretical electron density map. A computer program such as X-PLOR can be used for 
model refinement by those skilled in the art. Briinger (1992) Nature 355:472-475. 
Refinement may be achieved in an iterative process. A first step can entail altering the 
conformation of atoms defined in an electron density map. The conformations of the atoms 
can be altered by simulating a rise in temperature, which will increase the vibrational 
frequency of the bonds and modify positions of atoms in the stmcture. At a particular point 
in the atomic perturbation process, a force field, which typically defines interactions 
between atoms in terms of allowed bond angles and bond lengths. Van der Waals 
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interactions, hydrogen bonds, ionic interactions, and hydrophobic interactions, can be 
applied to the system of atorDS. Favorable interactions naay be described in terms of free 
energy and the atoms can'hd moved over many iterations until a free energy murnnum is 
achieved. The refinement process can be iterated until the R-factor reaches a minimum 
value. 

[021 5] The three dimensional stmcture of the molecule or molecule complex is described 
by atoms that fit the theoretical electron density characterized by a minimimi R- value, A 
file can then be created for the three dimensional structure that defines each atom by 
coordinates in three dimensions. An example of such a stmctural coordinate file is shown 
in Table L 

IV. Structures of an Exemplary Kinase Domain : human PIM-1 

[0216] As an example of kinase structure, high-resolution three-dimensional stmctures 
and atomic structure coordinates of crystalline PIM-1 and PIM-1 co-complexed with 
exemplary binding compounds as determined by X-ray crystallography are provided. The 
specific methods used to obtain the stmcture coordinates are provided in the examples. The 
atomic structure coordinates of crystalline PIM-l are listed in Table 1, and atomic 
coordinates for PIM-1 co-crystallized with AMP-PMP are provided in Table 4. Co-crystal 
coordinates can be used in the same way, e,g,y in the various aspects described herein, as 
coordinates for the protein by itself. 

[0217] Those having skill in the art will recognize that atomic structure coordinates as 
determined by X-ray crystallography are not without error. Thus, it is to be understood that 
any set of structure coordinates obtained for crystals of PIM-1, whether native crystals, 
derivative crystals or co-crystals, that have a root mean square deviation ("r.m.s.d.") of less 
than or equal to about 1 .5 A when superimposed, using backbone atoms (N, Ca, C aad 0), 
on the stmcture coordinates listed in Table 1 (or Table 4) are considered to be identical with 
the structure coordinates listed in the Table 1 (or Table 4) when at least about 50% to 100% 
of the backbone atoms of PBM-l are included in the superposition. 

[0218) In addition to the PIM-1 structures provided herein, additional protein kinase 
structures are available and can be used, for example, publicly available structures deposited 
in the Protein Data Bank (PDB) (available for example, over the Literaet). Higher quality 
stmctures are preferred (e.g., at least 2.5, 2.2, 2.0, 1.8, 1.7, 1.6, 1.5, 1.4, 1.3, or L2A 
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resolution, as they provide more or more precise information for compound fitting and 
selection or design of derivatives. 

V. Uses of the Crystals and Atomic Structure Coordinates 

[0219] Kinase crystals, and particularly the atomic structure coordinates obtained 
therefrom, have a wide variety of uses. For example, the kinase crystals such as those 
described herein can be used as a starting point in any of the methods of use for kinases 
known in the art or later developed. Such methods of use include, for example, identifying 
molecules that bind to the native or mutated catalytic domain of kinases. The crystals and 
stmcture coordinates are particularly useful for identifying ligands that modulate kinase 
activity as an approach towards developing new therapeutic agents. In particular, the 
crystals and structural information are useful in methods for ligand development utilizing 
molecular scaffolds. 

[0220] The structure coordinates described herein can be used as phasing models for 
determining the crystal stmctures of additional kinases, as well as the structures of co- 
crystals of such kinases with ligands such as ixih bitors, agonists, antagonists, and other 
molecules. The structure coordinates, as well as n^o.dpls of the three-dimensional structures 
obtained therefrom, can also be used to aid the elucidati on of solution-based structures of 
native or mutated kinases, such as those obtained via 'NIAR . 

VI. Electronic Representations of Kinase Structures 

[0221] Structural information of kinases or portions of kinases (e.g., kinase active sites) 
can be represented in many different ways. Particularly useful are electronic 
representations, as such representations allow rapid and convenient data manipulations and 
structural modifications. Electronic representations can be embedded in many different 
storage or memory media, frequently computer readable media. Examples include without 
limitations, computer random access memory (RAM), floppy disk, magnetic hard drive, 
magnetic tape (analog or digital), compact disk (CD), optical disk, CD-ROM, memory card, 
digital video disk (DVD), and others. The storage medium can be separate or part of a 
computer system. Such a computer system may be a dedicated, special purpose, or 
embedded system, such as a computer system that forms part of an X-ray crystallography 
system, or may be a general purpose computer (which may have data connection with other 
equipment such as a sensor device in an X-ray crystallographic system. In many cases, the 
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infoTmation provided by such electronic representations can also be represented physically 
or visually in two or toee dimensions, e.g., on paper, as a visual display (e,g„ on a 
computer monitor as a two dimensional or pseudo-three dimensional image) or as a three 
dimensional physical model. Such physical representations can also be used, alone or in 
cormection with electronic representations. Exemplary useful representations include, but 
are not limited tO;, the following: 

Atomic Coordinate Representation 

[0222] One type of representation is a list or table of atonuc coordinates representing 
positions of particular atoms in a molecular structure, portions of a stmcture, or complex 
(e,g., a co-crystal). Such a representation may also include adriitional information, for 
example, information about occupancy of particular coordinates. 

Energy Surface or Surface of hiteraction Representation 

[0223] Another representation is an energy surface representation, e.g., of an active site or 
ottier binding site, representing an energy surface for electronic and steric interactions. 
Such a representation may also include other features. An example is the inclusion of 
representation of a particular amino acid residue(s) or group(s) on a particular amino acid 
residue(s), e.g., a residue or group that can participate in H-bonding or ionic interaction. 

Structural Representation 

[0224] Still another representation is a stmctural representation, i. e , a physical 
representation or an electronic representation of such a physical representation. Such a 
stractural representation includes representations of relative positions of particular features 
of a molecule or complex, often with linkage between stractural features. For example, a 
structure can be represented in which all atoms are linked; atoms other than hydrogen are 
linked; backbone atoms, with or without representation of side chain atoms tiiat could 
participate in significant electronic interaction, are linked; among others. However, not all 
features need to be linked. For example, for structural representations of portions of a 
molecule or complex, structural features significant for that feature may be represented 
(e.g., atoms of amino acid residues that can have significant binding interaction with a 
ligand at a binding site. Those amino acid residues may not be linked with each other. 

(0225] A structural representation can also be a schematic representation. For example, a 
schematic representation can represent secondary and/or tertiary structure in a schematic 
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manner. Within such a schematic representation of a polypeptide, a particular amino acid 
residue(s) or group(s) on ajpsidue(s) can be included, e,g.^ conserved residues in % binding 
site, and/or residue(s) or^bup(s) that may interact with binding compoimds. 

Vn. structure Determination for Kinases with Unknown Structure Using Structural 
Coordinates 

[0226] Stmctural coordinates, such as those set forth in Table 1, can be used to determine 
the three dimensional stractures of kinases with unknown stracture. The methods described 
below can apply stmctural coordinates of a polypeptide with known stmcture to another 
data set, such as an amino acid sequence. X-ray crystallographic diffraction data, or nuclear 
magnetic resonance (TslMR) data. Preferred embodiments of the invention relate to 
determining the three dimensional structures of other PIM kinases, other serine/threonine 
kinases, and related polypeptides. 

Structures Using Amino Acid Homology 

[0227] Homology modeling is a method of applying stmctural coordinates of a 
polypeptide of known stracture to the amino acid sequence of a polypeptide of unknown 
stmcture. This method is accomplished using a computer representation of the three 
dimensional stracture of a polypeptide or polypeptide complex, the computer representation 
of amino acid sequences of the polypeptides with known and imknown stractures, and 
standard computer representations of the stractures of amino acids. Homology modeling 
generally involves (a) aligning the amino acid sequences of the polypeptides with and 
without known stracture; (b) transferring the coordinates of Ihe conserved amino acids in 
the known stracture to the corresponding amino acids of the polypeptide of unknown 
stracture; refining the subsequent three dimensional stracture; and (d) constructing 
stractures of the rest of the polypeptide. One skilled in the art recognizes that conserved 
amino acids between two proteins can be determined from the sequence alignment step in 
step (a). 

[0228] The above method is well known to those skilled in the art. (Greer (1985) Science 
228:1055; Blundell et al A(1988) Eur, J. BiochemL 172:513. An exemplary computer 
program that can be utilized for homology modeling by those skilled in the art is the 
Homology module in the hasight 11 modeling package distributed by Accelerys Inc. 

[0229] Alignment of the amino acid sequence is accomplished by first placing the 
computer representation of the amino acid sequence of a polypeptide with known stracture 
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above the amino acid sequence of the polypeptide of unknown structure. Amino acids in the 
sequences are then compared and groups of amino acids that are homologous (e.g., amino 
acid side chains that are similar in chemical nature - aliphatic, aromatic, polar, or charged) 
are grouped together. This method will detect conserved regions of the polypeptides and 
account for amino acid insertions or deletions. 

[0230] Once the amino acid sequences of the polypeptides with known and unknown 
structures are aligned, the structures of the conserved amino acids in the computer 
representation of the polypeptide with knoAvn structure are transferred to the corresponding 
amino acids of the polypeptide whose structure is unknown. For example, a tyrosine in the 
amino acid sequence of known stmcture may be replaced by a phenylalanine, the 
corresponding homologous amino acid in the amino acid sequence of unknown stmcture. 

[0231] The stmctures of amino acids located in non-conserved regions are to be assigned 
manually by either using standard peptide geometries or molecular simulation techniques, 
such as molecular dynamics. The final step in the process is accomplished by refining the 
entire structure using molecular dynamics and/or energ ' minimization. The homology 
modeling method is well known to those skilled in the i i and has been practiced using 
different protein molecules. For example, the three dinieiisl»>naj. jstnicture of the polypeptide 
corresponding to the catalytic domain of a serine/threonine piot^ntr kinase, myosin light 
chain protein kinase, was homology modeled jfrom the cAMP-dep^^ndent protein kinase 
catalytic subunit, (Knighton et aL (1992) Science 258:130-135.) 

Structures Using Molecular Replacement 

[0232] Molecular replacement is a method of applying the X-ray diffraction data of a 
polypeptide of known structure to the X-ray diffraction data of a polypeptide of unknown 
sequence. This method can be utilized to define the phases describing the X-ray diffraction 
data of a polypeptide of unknown structure when only the amplitudes are known. X-PLOR 
is a commonly utilized computer software package used for molecular replacement. 
Brunger (1992) Nature 355:472-475. AMORE is another program used for molecular 
replacement. Navaza (1994) Acta Crystallogr, A50:157-163. Preferably, the resulting 
structure does not exhibit a root-mean-square deviation of more than 3A. 

[0233] A goal of molecular replacement is to align the positions of atoms in the unit cell 
by matching electron diffraction data from two crystals. A program such as X-PLOR can 
involve four steps. A first step can be to determine the number of molecules in the unit cell 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



55 



PCT/US2004/030360 



and define the angles between them. A second step can involve rotating the diflfraction data 
to define the orientation of the molecules in the unit cell. A third step can be to translate the 
electron density in three dimensions to correctly position the molecules in the unit cell. 
Once the amplitudes and phases of the X-ray diffraction data is detemiined, an R-factor can 
be calculated by comparing electron diffi-action maps calculated experimentally firom the 
reference data set and calculated from the new data set. An R-factor between 30-50% 
indicates that the orientations of the atoms in the unit cell are reasonably determined by this 
method. A fourth step in the process can be to decrease tihe R-factor to roughly 20% by 
refining the new electron density map using iterative refinement techniques described herein 
and known to tihiose or ordinary skill in the art. 

Structures Using NMR Data 
[0234] Structural coordinates of a polypeptide or polypeptide complex derived fi^om X-ray 
crystallographic techniques can be applied towards the elucidation of three dimensional 
structiu-es of polypeptides from nuclear magnetic resonance (NMR) data. This method is 
used by those skilled in the art. (Wuthrich, (1986), John Wiley and Sons, New York:176- 
1 99; Pflugrath et al (1986) J. Mol Biol 1 89:383-386; Kline et al (1986) J. Mol Biol 
189:377-382). While the secondary structure of a polypeptide is often readily determined by 
utilizing two-dimensional NMR data, the spatial coxmections between individual pieces of 
secondary structure are not as readily determinable. The coordinates defining a three- 
dimensional structure of a polypeptide derived from X-ray crystallographic techniques can 
guide the NMR spectroscopist to an understanding of these spatial interactions between 
secondary structural elements in a polypeptide of related stmcture, 

[0235] The knowledge of spatial interactions between secondary structural elements can 
greatly simplify Nuclear Overhauser Effect (NOE) data from two-dimensional NMR 
experiments. Additionally, applying the crystallographic coordinates after the determination 
of secondary structure by NMR techniques only simplifies the assignment of NOEs relating 
to particular amino acids in the polypeptide sequence and does not greatly bias the NMR 
analysis of polypeptide structure. Conversely, using the crystallographic coordinates to 
simplify NOE data while determining secondary structure of the polypeptide would bias the 
NMR analysis of protein structure. 

Vm. Structure-Based Design of Modulators of Kinase Function Utilizing Structural 
Coordinates 
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[0236] Structure-based modulator design and identification methods are powerful 
techniques that can involve searches of computer databases containing a wide variety of 
potential modulators and chemical functional groups. The computerized design and 
identification of modulators is useful as the computer databases contain more compounds 
than the chemical libraries, often by an order of magnitude. For reviews of structure-based 
dmg design and identification (see Kuntz et al (1994), Acc. Chem. Res. 27:1 17; Guida 
(1994) Current Opinion in Struc, BioL 4: 777; Colman (1994) Current Opinion in Struc. 
Biol 4: 868). 

[0237] The three dimensional structure of a polypeptide defined by structural coordinates 
can be utiUzed by these design methods, for example, the structural coordinates of Table 1. 
In addition, the three dimensional structures of kinases determined by the homology, 
molecular replacement, and NMR techniques described herein can also be applied to 
modulator design and identification methods. 

[0238] For identifying modulators, structural information for a native kinase, in particular, 
stmctural information for the active site of the kinase, can be used. However, it may be 
advantageous to utihze stmctural information from one or more co-crystals of the kinase 
with one or more binding compounds. It can also be advantageous if the binding compound 
has a structural core in common with test compoimds. 

Design by Searching Molecular Data Bases 

[0239] One method of rational design searches for modulators by docking the computer 
representations of compounds from a database of molecules. Publicly available databases 
include, for example: 



a) 


ACD from Molecular Designs Limited 


b) 


NCI from National Cancer Institute 


c) 


CCDC from Cambridge Crystallographic Data Center 


d) 


CAST from Chemical Abstract Service 


e) 


Derwent from Derwent Information Limited 


f) 


Maybridge from Maybridge Chemical Company LTD 


g) 


Aldrich from Aldrich Chemical Company 


^) 


Directory of Natural Products from Chapman & Hall 



[0240] One such data base (ACD distributed by Molecular Designs Limited Information 
Systems) contains compounds that are synthetically derived or are natural products. 
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Methods available to those skilled in the art can convert a data set represented in two 
dimensions to one represented in three dimensions. These methods are enabled by such 
computer programs as CONCORD from Tripos Associates or DE-Converter from 
Molecular Simulations Limited. 

[0241] Multiple methods of stractxire-based modulator design are known to fliose in the 
art. (Kuntz et aL (1982), J. Mol Biol 162: 269; Kuntz et al., (1994), Acc. Chem, Res. 27: 
111; Meng et al, (1992), J, Compt. Chem, 13: 505; Bohm, (1994), J. Comp, Aided Molec. 
Design 8: 623). 

[0242] A computer program widely utilized by those skilled in the art of rational 
modulator design is DOCK from the University of California in San Francisco, The general 
methods utilized by this computer program and programs like it are described in three 
applications below. More detailed inforaiation regarding some of these techniques can be 
found in the Accelerys User Guide, 1995. A typical computer program used for this 
purpose can comprise the following steps: 

(a) remove the existing compound from the protein; 

(b) dock the structure of another compound into the active-site using the computer 

program (such as DOCK) or by interactively moving the compound into the 
active-site; 

(c) characterize the space between the compound and the active-site atoms; 

(d) search libraries for molecular fragments which (i) can fit into the empty space 

between the compound and the active-site, and (ii) can be liiiked to the 
compound; and 

(e) link the fragments found above to the compound and evaluate the new modified 

compound. 

[0243] Part (c) refers to characterizing the geometry and the complementary interactions 
formed between the atoms of the active site and the compounds. A favorable geometric fit is 
attained when a significant surface area is shared between the compound and active-site 
atoms without fomiing unfavorable steric interactions. One skilled in the art would note 
that the method can be performed by skipping parts (d) and (e) and screening a database of 
many compounds. 

[0244] . Stmcture-based design and identification of modulators of kinase function can be 
used in conjunction with assay screening. As large computer databases of compounds 
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(around 10,000 compounds) can be searched in a matter of hours, the computer-based 
method can narrow the compounds tested as potential modulators of kinase function in 
biochemical or cellular assays. 

[0245] The above descriptions of structure-based modulator design are not all 
encompassing and other methods are rqjorted in the literature: 

(1) CAVEAT: Bartlett et al.,(1989), in Chemical and Biological Problems in 

Molecular Recognition, Roberts, S.M.; Ley, S.V.; Campbell, M.M, eds.; Royal 
Society of Chemistry: Cambridge, ppl82-196. 

(2) FLOG: Miller et aL, (1994), X Camp. Aided Molec, Design 8:153. 

(3) PRO Modulator: Clark et aL, (1995), J. Camp. Aided Molec, Design 9:13. 

(4) MCSS: Miranker and Karplus, (1991), Proteins: Structure, Function, and 

Genetics 11:29, 

(5) AUI ODOCK: Goodsell and Olson, (1990), Proteins: Structure, Function, and 

Genetics 8:195. 

(6) (5RID: Goodford, (1985), J. Med Chem. 28:849. 

Design by Modifying Compounds in Complex with a Kinase 

[0246] Another way of identifying compounds as potential modulators is to modify an 
existing modulator in the polypeptide active site. For example, the computer representation 
of modulators can be modified within the computer represesatation of a PIM-1 or other PIM. 
kinase active site. Detailed instractions for this technique can be found in the Accelerys 
User Manual, 1995 in LUDI. The computer representation of the modulator is typically 
modified by the deletion of a chemical group or groups or by the addition of a chemical 
group or groups, 

[0247] Upon each modification to the compound, the atoms of die modified compound 
and active site can be shifted in conformation and the distance between the modulator and 
the active-site atoms may be scored along with any complementary interactions formed 
between the two molecules. Scoring can be complete when a favorable geometric fit and 
favorable complementary interactions are attained. Compounds that have favorable scores 
are potential modulators. 
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Design by Modifying the Structure of Compounds that Bind a Kinase 
[0248J A third method of structure-based modulator design is to screen compoimds 
designed by a modulator building or modulator searching computer program. Examples of 
these types of programs can be found in the Molecular Simulations Package, Catalyst. 
Descriptions for using this program are documented in the Molecular Simulations User 
Guide (1995). Other computer programs used in this application are ISIS/HOST, 
ISIS/BASE, ISIS/DRAW) from Molecular Designs Limited and UNITY from Tripos 
Associates. 

[0249] These programs can be operated on the structure of a compoimd that has been 
removed from the active site of the three dimensional structure of a compound-kinase 
complex. Operating the program on such a compound is preferable since it is in a 
biologically active conformation. 

[0250] A modulator construction computer program is a computer program that may be 
used to replace computer representations of chemical groups in a coripound complexed 
with a kinase or other biomolecule with groups from a computer dati lase. A modulator 
searching computer program is a computer program that may be used .o sar ch computer 
representations of compounds from a computer data base that have similar tJi. -ee 
dimensional structures and similar chemical groups as compoimd bound to i\ pi j ticular 
biomolecule, 

[02511 A typical program can operate by using the following general steps: 

(a) map the compounds by chemical features such as by hydrogen bond donors or 

acceptors, hydrophobic/lipophilic sites, positively ionizable sites, or negatively 
ionizable sites; 

(b) add geometric constraints to the mapped features; and 

(c) search databases with the model generated in (b). 

[0252] Those skilled in the art also recognize that not all of the possible chemical features 
of the compound need be present in the model of (b). One can use any subset of the model 
to generate different models for data base searches. 
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Modulator Design Using Molecular Scaffolds 

"^^©^ 

[0253] The present invention can also advantageously utilize methods for designing 
compounds, designated as molecular scaffolds, that can act broadly across families of 
molecules and for using the molecular scaffold to design ligands that target individual or 
multiple members of those families. In preferred embodiments, the molecules can be 
proteins and a set of chemical compounds can be assembled that have properties such that 
they are 1) chemically designed to act on certain protein families and/or 2) behave more like 
molecular scaffolds, meaning that they have chemical substructures that make them specific 
for binding to one or more proteins in a family of interest Altematively, molecular 
scaffolds can be designed that are preferentially active on an individual target molecule. 

[0254] Usefiil chemical properties of molecular scaffolds can include one or more of the 
following characteristics, but are not limited thereto: an average molecular weight below 
about 350 daltons, or between from about 150 to about 350 daltons, or from about 150 to 
about 300 daltons; having a clogP below 3; a number of rotatable bonds of less than 4; a 
number of hydrogen bond donors and acceptors below 5 or below 4; a polar surface area of 
less than 50 A^; binding at protein binding sites in an orientation so that chemical 
substituents from a combinatorial library that are attached to the scaffold can be projected 
into pockets in the protein binding site; and possessing chemically tractable stractures at its 
substituent attachment points that can be modified, thereby enabhng rapid library 
construction. 

[0255] By "clog P" is meant the calculated log P of a compound, "P" referring to the 
partition coefficient between octanol and water. 

[0256] The term "Molecular Polar Surface Area (PSA)" refers to the sxmi of surface 
contributions of polar atoms (usually oxygens, nitrogens and attached hydrogens) in a 
molecule. The polar smface area has been shown to correlate well with drag transport 
properties, such as intestinal absorption, or blood-brain barrier penetration. 

[0257] Additional useful chemical properties of distinct compounds for inclusion in a 
combinatorial library include the ability to attach chemical moieties to the compound that 
will not interfere with binding of the compound to at least one protein of interest, and that 
will impart desirable properties to the library members, for example, causing the library 
members to be actively transported to cells and/or organs of interest, or the ability to attach 
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to a device such as a chromatography column (e.g., a streptavidin columLti through a 
molecule such as biotin) for uses such as tissue and proteomics profiling purposes. 

[02581 A person of ordinary skill in the art will realize other properties that can be 
desirable for the scaffold or library members to have depending on the particular 
requirements of the use, and that compounds with these properties can also be sought and 
identified in like manner. Methods of selecting compounds for assay are known to those of 
ordinary skill in the art, for example, methods and compoimds described in U.S. Patent No. 
6,288,234, 6,090,912, 5,840,485, each of which is hereby incorporated by reference in its 
entirety, including all charts and drawings. 

[0259] In various embodiments, the present invention provides methods of designing 
ligands that bind to a plurality of members of a molecular family, where the ligands contain 
a common molecular scaffold. Thus, a compound set can be assayed for binding to a 
plurality of members of a molecular family, e.g., a protein family. One or more compoimds 
that bind to a plurality of family members can be identified as molecular scaffolds. When 
the orientation of the scaffold at the binding site of the target molecules has been 
determined and chemically tractable structures have been identified, a set of ligands can be 
synthesized starting with one or a few molecular scaffolds to arrive at a plurality of ligands, 
wherein each ligand binds to a separate target molecule of the molecular family with altered 
or changed binding affinity or binding specificity relative to the scaffolds Thus, a plurality 
of drug lead molecules can be designed to preferentially target individual members of a 
molecular family based on the same molecular scaffold, and act on them in a specific 
manner. 

Binding Assays 

[0260] The methods of the present invention can involve assays that are able to detect the 
binding of compounds to a target molecule at a signal of at least about three times the 
standard deviation of the background signal, or at least about four times the standard 
deviation of the background signal. The assays of the present invention can also include 
assaying compounds for low affinity binding to the target molecule. A large variety of 
assays indicative of binding are knovm for different target types and can be used for this 
invention. Compounds that act broadly across protein families are not likely to have a high 
affinity against individual targets, due to the broad nature of their binding. Thus, assays 
described herein allow for the identification of compounds that bind with low affinity, very 
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low affinity, and extremely low affinity. Therefore, potency (or binding affinity) is not the 
primary, nor even the most important, indicia of identification of a potentially usefiil 
binding compound. Rather, even those compounds that bind with low affinity, very low 
affinity, or extremely low affinity can be considered as molecular scaffolds that can 
continue to the next phase of the ligand design process. 

[0261 ] By binding with "low aflSnity'' is meant binding to the target molecule with a 
dissociation constant (ka) of greater than 1 ixM under standard conditions. By binding with 
"very low affinity" is meant binding with a kd of above about 100 jiM xmder standard 
conditions. By binding with "extremely low affinity" is meant binding at a ka of above 
about 1 mM under standard conditions. By "moderate affinity" is meant binding with a kd 
of from about 200 nM to about 1 fiM under standard conditions. By "moderately high 
affinity*' is meant binding at a kd of from about 1 nM to about 200 nM. By binding at "high 
afiSnity" is meant binding at a kd of below about 1 nM under standard conditions. For 
example, low affinity binding can occur because of a poorer fit into the binding site of the 
target molecule or because of a smaller number of non-covalent bonds, or ^A'^eaker covalent 
bonds present to cause binding of the scaffold or ligand to the binding si te >f the target 
molecule relative to instances where higher aflSnity binding occurs. The staidard 
conditions for binding are at pH 7.2 at 37°C for one hour. For example, 100 u!/wc^! can be 
used in HEPES 50 mM buffer at pH 7.2, NaCl 15 mM, ATP 2 ^iM, and bovine smun:> 
albumin 1 ug/well, 37'^C for one hour. 

[0262] Binding compounds can also be characterized by their effect on the activity of the 
target molecule. Thus, a "low activity" compound has an inhibitory concentration (IC50) or 
excitation concentration (EC50) of greater than 1 piM under standard conditions. By "very 
low activity'' is meant an IC50 or EC50 of above 100 |aM under standard conditions. By 
"extremely low activity" is meant an IC50 or EC50 of above 1 mM under standard 
conditions. By "moderate activity'' is meant an IC50 or EC50 of 200 nM to 1 nM under 
standard conditions. By "moderately high activity'' is meant an IC50 or EC50 of 1 nM to 200 
nM. By "high activity" is meant an IC50 or EC50 of below 1 nM under standard conditions. 
The IC50 (or EC50) is defined as the concentration of compound at which 50% of the activity 
of the target molecule (e.g., enzyme or other protein) activity being measured is lost (or 
gained) relative to activity when no compound is present Activity can be measured using 
methods known to those of ordinary skill in the art, e.g., by measuring any detectable 
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product or signal produced by occurrence of an enzymatic reaction, or other activity by a 
protein being nieas^ured^vgagti. 



[0263] By "background signal'' in reference to a binding assay is meant the signal that is 
recorded under standard conditions for the particular assay in the absence of a test 
compound, molecular scaffold, or ligand that binds to the target molecule. Persons of 
ordinary skill in the art will realize that accepted methods exist and are widely available for 
detemiining background signal. 

[0264] By "standard deviation" is meant the square root of the variance. The variance is a 
measure of how spread out a distribution is. It is computed as the average squared deviation 
of each number from its mean. For example, for the numbers 1 , 2, and 3, the mean is 2 and 
the variance is: 



[0265] To design or discover scaffolds that act broadly across protein families, proteins of 
interest can be assayed against a compound collection or set. The assays can preferably be 
enzymatic or binding assays. In some embodiments it may be desirable to enhance the 
solubility of the compounds being screened and then analyze all compounds that show 
activity in the assay, including those that bind with low affinity or produce a signal with 
greater than about three times the standard deviation of the background signal. The assays 
can be any suitable assay such as, for example, binding assays that measure the binding 
affinity between two binding partners. Varioxis types of screening assays that can be useful 
in the practice of the present invention are known in the art, such as those described in U.S. 
Patent Nos. 5,763,198, 5,747,276, 5,877,007, 6,243,980, 6,294,330, and 6,294,330, each of 
which is hereby incorporated by reference in its entirety, including all charts and drawings. 

[0266] In various embodiments of the assays at least one compound, at least about 5%, at 
least about 10%, at least about 15%, at least about 20%, or at least about 25% of the 
compounds can bind with low affinity. In general, up to about 20% of the compounds can 
show activity in the screening assay and these compoimds can then be analyzed directly 
v^th high-throughput co-crystallography, computational analysis to group the compounds 
into classes with common structural properties (e.g., structural core and/or shape and 
polarity characteristics), and the identification of common chemical stractures between 
compounds that show activity. 




- a-2f + f2-2^^ + f3-2^^. - 0.667 
3 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



64 



PCT/US2004/030360 



[0267] The person of ordinary skill in the art will realize that decisions can be based on 
criteria that are appropriate for the needs of the particular situation, and that the decisions 
can be made by computer software programs. Classes can be created containing almost any 
number of scaffolds, and the criteria selected can be based on increasingly exacting criteria 
until an arbitrary number of scaffolds is arrived at for each class that is deemed to be 
advantageous. 

Surface Plasmon Resonance 

[0268] Binding parameters can be measured using surface plasmon resonance, for 
example, with aBIAcore® chip (Biacore, Japan) coated with inamobilized binding 
components. Surface plasmon resonance is used to characterize the microscopic association 
and dissociation constants of reaction between an sFv or other ligand directed against target 
molecules. Such methods are generally described in the following references which are 
incorporated herein by reference. Vely F. et al., (2000) BIAcore® analysis to test 
phosphopeptide-SH2 domain interactions. Methods in Molecular Biology, 1 2 1 :3 1 3-2 1 ; 
Liparoto et al.^ (1999) Biosensor analysis of the interleukin-2 receptor complex. Journal of 
Molecular Recognition, 12:316-21; Lipschultz et al., (2000) Experimental design for 
analysis of complex kinetics using surface plasmon resonance. Methods. 20(3):310-8; 
Malmqvist., (1999) BIACORE: an affinity biosensor system for characterization of 
biomolecular interactions. Biochemical Society Transactions 27:335-40; Alfthan, (1998) 
Surface plasmon resonance biosensors as a tool in antibody engineering. Biosensors & 
Bioelectronics. 13:653-63; Fivash et aL, (1998) BIAcore for macromolecular interaction, 
Ciurent Opinion in Biotechnology. 9:97-101; Price et al.; (1998) Summary report on the 
ISOBM TD-4 Workshop: analysis of 56 monoclonal antibodies against the MUCl mucin. 
Tumour Biology 19 Suppl 1:1-20; Malmqvist et al, (1997) Biomolecular interaction 
analysis: affinity biosensor technologies for functional analysis of proteins. Current 
Opinion in Chemical Biology. 1:378-83; O'Shannessy et al., (1996) Interpretation of 
deviations from pseudo-first-order kinetic behavior in the characterization of Hgand binding 
by biosensor technology, Analytical Biochemistry. 236:275-83; Malmborg et aL, (1995) 
BIAcore as a tool in antibody engineering. Journal of Immunological Methods, 183:7-13; 
Van Regenmortel, (1994) Use of biosensors to characterize recombinant proteins. 
Developments in Biological Standardization. 83:143-51; and O'Shannessy, (1994) 
Determination of kinetic rate and equilibrium binding constants for macromolecular 
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interactions: a critique of the surface plasmon resonance literature, Current Opinions in 
Biotechnology. 5 :65-7 1 . 

[0269] BIAcore® uses the optical properties of surface plasmon resonance (SPR) to detect 
alterations in protein concentration bound to a dextran matrix lying on the surface of a 
gold/glass sensor chip interface, a dextran biosensor matrix. In brief, proteins are 
covalently bound to the dextran matrix at a known concentration and a ligand for the protein 
is injected through the dextran matrix. Near infrared hght, directed onto the opposite side of 
the sensor chip surface is reflected and also induces an evanescent wave in the gold film, 
which in turn, causes an intensity dip in the reflected light at a particular angle known as the 
resonance angle. If the refractive index of the sensor chip surface is altered (e.g., by ligand 
binding to the bound protein) a shift occurs in the resonance angle. This angle shift can be 
measured and is expressed as resonance units (RUs) such that 1000 RUs is equivalent to a 
change in smrface protein concentration of 1 ng/nmi^. These changes are displayed with 
respect to time along the y-axis of a sensorgram, which depicts the association and 
dissociation of any biological reaction. 

High Throughput Screening (HTS) Assays 

[0270] HTS typically uses automated assays to search through large numbers of 
compounds for a desired activity. Tj^ically HTS assays are used to find new drugs by 
screening for chemicals that act on a particular enzyme or molecule. For example, if a 
chemical inactivates an enzyme it might prove to be effective in preventing a process in a 
cell which causes a disease. High throughput methods enable researchers to assay 
thousands of different chemicals against each target molecule very quickly using robotic 
handling systems and automated analysis of results. 

10271] As used herein, "high throughput screening" or "HTS" refers to the rapid in vitro 
screening of large numbers of compounds (libraries); generally tens to hundreds of 
thousands of compounds, using robotic screening assays. Ultra high-throughput Screening 
(uHTS) generally refers to the high-throughput screening accelerated to greater than 
100,000 tests per day. 

10272] To achieve high-throughput screening, it is advantageous to house samples on a 
multicontainer carrier or platform. A multicontainer carrier facilitates measuring reactions 
of a plurality of candidate compounds simultaneously. Multi-well microplates may be used 
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as the carrier. Such multi-well microplates, and methods for their use in numerous assays, 
are both known in the sat aiid conunercially available. 

[0273] Screening assays may include controls for purposes of calibration and 
confirmation of proper manipulation of the components of the assay. Blank wells that 
contain all of the reactants but no member of the chemical library are usually included. As 
another example, a known inhibitor (or activator) of an enzyme for which modulators are 
sought, can be incubated with one sample of the assay, and the resulting decrease (or 
increase) in the enzyme activity used as a comparator or control. It will be appreciated that 
modulators can also be combined with the enzyme activators or inhibitors to find 
modulators which inhibit the enzyme activation or repression that is otherwise caused by the 
presence of the knovm the enzyme modulator. Similarly, when ligands to a sphingolipid 
target are sougjht, known ligands of the target can be present in control/calibration assay 
wells. 

Measuring Enzymatic and Binding Reactions During Screening Assays 

[0274] Techniques for measuring the progression of enzymatic and binding reactions, e.g., 
in multicontainer carriers, are known in the art and include, but are not limited to, the 
following. 

[0275] Spectrophotometric and spectrofluorometric assays are well known in the art. 
Examples of such assays include the use of colorimetric assays for the detection of 
peroxides, as disclosed in Example 1(b) and Gordon, A. J. and Ford, R. A., (1972) The 
Chemist's Companion: A Handbook Of Practical Data. Techniques, And References . John 
Wiley and Sons, N.Y., Page 437. 

[02761 Fluorescence spectrometry may be used to monitor the generation of reaction 
products. Fluorescence methodology is generally more sensitive than the absorption 
methodology. Theuseof fluorescent probes is well known to those skilled in the art. For 
reviews, see Bashford et al., (1987) Spectrophotometry and Snectrofluorometrv: A Practical 
Approach, pp. 91-1 14, IRL Press Ltd.; and Bell, (1981) Spectroscopy In Biochemistry. Vol. 
I, pp. 155-194, CRC Press. 

[02771 In spectrofluorometric methods, enzymes are exposed to substrates that change 
their intrinsic fluorescence when processed by the target enzyme. Typically, the substrate is 
nonfluorescent and is converted to a fluorophore through one or more reactions. As a non- 
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limiting example, SMase activity can be detected using the Amplex Red reagent 
(Molecular Probes, Eugene, OR). In order to measure sphingomyelinase activity using 
Amplex® Red, the following reactions occur. First, SMase hydrolyzes sphingomyelin to 
yield ceramide and phosphorylcholine. Second, alkaline phosphatase hydrolyzes 
phosphorylcholine to yield choline. Third, choline is oxidized by choline oxidase to 
betaine. Finally, H2O2, in the presence of horseradish peroxidase, reacts with Amplex® Red 
to produce the fluorescent product, Resorufin, and the signal therefrom is detected using 
spectrofluorometry. 

[0278J Fluorescence polarization (FP) is based on a decrease in the speed of molecular 
rotation of a fluorophore that occurs upon binding to a larger molecule, such as a receptor 
protein, allowing for polarized fluorescent emission by the bound ligand. FP is empirically 
determined by measuring the vertical and horizontal components of fluorophore emission 
following excitation with plane polarized light. Polarized emission is increased when the 
molecular rotation of a fluorophore is reduced. A fluorophore produces a larger polarized 
signal when it is bound to a larger molecule (i.e. a receptor), slowing molecular rotation of 
the fluorophore. The magnitude of the polarized signal relates quantitatively to the extent of 
fluorescent ligand binding. Accordingly, polarization of the "bound" signal, depends on 
maintenance of high affinity binding. 

[02791 FP is a homogeneous technology and reactions are very rapid, taking seconds to 
minutes to reach equilibrium. The reagents are stable, and large batches may be prepared, 
resulting in high reproducibility. Because of these properties, FP has proven to be highly 
automatable, often performed with a single incubation with a single, premixed, tracer- 
receptor reagent. For a review, see Owickiet al., (1997), Application of Fluorescence 
Polarization Assays in High-Throughput Screening, Genetic Engineering News, 17:27. 

[0280] FP is particularly desirable since its readout is independent of the emission 
intensity (Checovich, W. J., et al., (1995) Nature 375:254-256; Dandliker, W. B., et al., 
(1981) Methods in Enzymology 74:3-28) and is thus insensitive to the presence of colored 
compounds that quench fluorescence emission. FP and FRET (see below) are well-suited 
for identifying compounds that block interactions between sphingolipid receptors and their 
ligands. See, for example, Parker et al., (2000) Development of high throughput screening 
assays using fluorescence polarization: nuclear receptor-ligand-binding and 
kinase/phosphatase assays, JjBzomo/ Screen 5:77-88. 
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[0281] Fluorophores derived from sphingolipids that may be used in FP assays are 
commercially available. For example. Molecular Probes (Eugene, OR) currently sells 
sphingomyelin and one ceramide fluorophores. These are, respectively, N-(4,4-difluoro- 
5,7-dimethyl-4-bora-3a,4a~dia2:a-s-indacene- 3-pentanoyl)sphingosyl phosphocholine 
(BODIPY® FL C5-sphingomyelin); N-(4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s- 
indacene- 3-dodecanoyl)sphingosyl phosphocholine (BODIPY® FL C12-sphingomyelin); 
and N-(4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s-indacene- 3-pentanoyl)sphingosine 
(BODIPY® FL C5-ceramide). U.S. Patent No. 4,150,949, (Immunoassay for gentaraicin), 
discloses fluorescein-iabelled gentamicins, including fluoresceinthiocarbanyl gentamicin. 
Additional fluorophores may be prepared using methods well known to the skilled artisan. 

[0282] Exemplary nomial-and-polarized fluorescence readers include the POLARION® 
fluorescence polarization system (Tecan AG, Hombrechtikon, Switzerland). General 
multiwell plate readers for other assays are available, such as the VERSAMAX® reader and 
the SPECTRAMAX® multiwell plate spectrophotometer (both from Molecular Devices). 

[0283] Fluorescence resonance energy transfer (FRET) is another useful assay for 
detecting interaction and has been described. See, e.g., Heim et al., (1996) Cum Biol 
6:178-182; Mitra et al., (1996) Gene 173:13-17; and Selvin et al., (1995) Meth, Enzymol 
246:300-345. FRET detects the transfer of energy between two fluorescent substances in 
close proximity, having known excitation and emission wavelengths. As an example, a 
protein can be expressed as a fusion protein with green fluorescent protein (GFP). When 
two fluorescent proteins are in proximity, such as when a protein specifically interacts with 
a target molecule, the resonance energy can be transferred from one excited molecule to the 
other. As a result, the emission spectrum of the sample shifts, which can be measured by a 
fluorometer, such as a fMAX multiwell fluorometer (Molecular Devices, Sunnyvale Calif). 

[0284] Scintillation proximity assay (SPA) is a particularly useful assay for detecting an 
interaction with the target molecule. SPA is widely used in the pharmaceutical industry and 
has been described (Hanselman et aL, (1997) X Lipid Res. 38:2365-2373; Kahl et aL, (1996) 
Anal Biochem. 243:282-283; Undenfriend et al., (1987) Anal Biochem. 161:494-500). See 
also U.S. Patent Nos. 4,626,513 and 4,568,649, and European Patent No. 0,154,734. One 
commercially available system uses FLASHPLATE® scintillant-coated plates (NEN Life 
Science Products, Boston, MA). 
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[0285] The target molecule can be bound to the scintillator plates by a variety of well 
known means. Scintillant plates are available that are derivatized to bind to fusion proteins 
such as GST, His6 or Flag fusion proteins. Where the target molecule is a protein complex 
or a multimer, one protein or subunit can be attached to the plate first, then the other 
components of the complex added later under binding conditions, resulting in a bound 
complex. 

[0286] In a typical SPA assay, the gene products in the expression pool will have been 
radiolabeled and added to the wells, and allowed to interact with the solid phase, which is 
the immobilized target molecule and scintillant coating in the wells. The assay can be 
measured immediately or allowed to reach equilibrium. Either way, when a radiolabel 
becomes sufficiently close to the scintillant coating, it produces a signal detectable by a 
device such as a TOPCOUNT NXT® microplate scintillation counter (Packard Bioscience 
Co., Meriden Coim.)- If a radiolabeled expression product binds to the target molecule, the 
radiolabel remains in proximity to the schitillant long enough to produce a detectable signal. 

[021 7] In contrast, the labeled proteins that do not bind to the target molecule, or bind 
onl> briefly^ will not remain near the scintillant long enough to produce a signal above 
backgrov^.^d. Any time spent near the scintillant caused by random Brownian motion Avill 
also no t result n a significant amount of signal. Likewise, residual unincorporated 
radiolabel i sc4 during the expression step may be present, but will not generate significant 
signal because it ivill be in solution rather than interacting with the target molecule. These 
non-binding interactions will therefore cause a certain level of background signal that can 
be mathematically removed. If too many signals are obtained, salt or other modifiers can be 
added directly to the assay plates until the desired specificity is obtained (Nichols et aL, 
(1998) AnaL Biochem. 257:112-119). 

Assay Compounds and Molecular Scaffolds 

[0288] Preferred characteristics of a scaffold include being of low molecular weight (e.g., 
less than 350 Da, or from about 100 to about 350 daltons, or fi*om about 150 to about 300 
daltons). Preferably clog P of a scaffold is from -1 to 8, more preferably less than 6, 5, or 4, 
most preferably less than 3. In particular embodiments the clogP is in a range -1 to an 
upper limit of 2, 3, 4, 5, 6, or 8; or is in a range of 0 to an upper limit of 2,3, 4, 5, 6, or 8. 
Preferably the number of rotatable bonds is less than 5, more preferably less than 4. 
Preferably the number of hydrogen bond donors and acceptors is below 6, more preferably 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



70 



PCT/US2004/030360 



below 5. An additional criterion that can be useful is a polar surface area of less than 5. 
Guidance that can be useful in identifying criteria for a particular appUcation can be found 
in Lipinski et aL, (1997) Advanced Drug Delivery Reviews 23 3-25, which is hereby 
incorporated by reference in its entirety. 

[0289] A scaffold may preferably bind to a given protein binding site in a configuration 
that causes substituent moieties of the scaffold to be situated in pockets of the protein 
binding site. Also, possessing chemically tractable groups that can be chemically modified, 
particularly through synthetic reactions, to easily create a combinatorial library can be a 
preferred characteristic of the scaffold. Also preferred can be having positions on the 
scaffold to which other moieties can be attached, which do not interfere with binding of the 
scaffold to the protein(s) of interest but do cause the scaffold to achieve a desirable 
property, for example, active transport of the scaffold to cells and/br organs, enabling the 
scaffold to be attached to a chromatographic column to facilitate analysis, or another 
desirable property. A molecular scaffold can bind to a target molecule with any affinity, 
such as binding with an affinity measurable as about three times the standard deviation of 
the background signal, or at high affinity, moderate affinity, low affinity, very low affinity, 
or extremely low affinity. 

[0290] Thus, the above criteria can be utilized to select many compounds for testing that 
have the desired attributes. Many compounds having the criteria described are available in 
the commercial market, and may be selected for assaying depending on the specific needs to 
which the methods are to be applied. 

[0291] A "compound library" or "library" is a collection of different compoimds having 
different chemical structures. A compound library is screenable, that is, the compound 
library members therein may be subject to screening assays. In preferred embodiments, the 
library members can have a molecular weight of firom about 100 to about 350 daltons, or 
from about 150 to about 350 daltons. Examples of libraries are provided above. 

[0292] Libraries of the present invention can contain at least one compound than binds to 
the target molecule at low affinity. Libraries of candidate compounds can be assayed by 
many different assays, such as those described above, e.g., a fluorescence polarization 
assay. Libraries may consist of chemically synthesized peptides, peptidomimetics, or arrays 
of combinatorial chemicals that are large or small, focused or nonfocused. By 'focused" it 
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is meant that the collection of compounds is prepared using the structure of previously 
characterized compounds and/or pharmacophores. 

[0293] Compound libraries may contain molecules isolated from natural sources, 
artificially synthesized molecules, or molecules synthesized, isolated, or otherwise prepared 
in such a manner so as to have one or more moieties variable, e,g., moieties that are 
independently isolated or randomly synthesized. Types of molecules in compound libraries 
include but are not limited to organic compoimds, polypeptides and nucleic acids as those 
terms are used herein, and derivatives, conjugates and mixtures thereof. 

[0294] Compound libraries of the invention may be purchased on the contunercial market 
or prepared or obtained by any means including, but not limited to, combinatorial chemistry 
techniques, fermentation methods, plant and cellular extraction procedures and the Hke (see, 
e.g., Cwirla et al., (1990) Biochemistry, 87, 6378-6382; Houghten et al., (1991) Nature, 354, 
84-86; Lam et al., (1991) Nature, 354, 82-84; Brenner et al., (1992) Proc, Natl Acad, Scl 
USA, 89, 5381-5383; R. A. Houghten, (1993) Trends Genet, 9, 235-239; E. R. Felder, 
(1994) Chimia, 48, 512-541; Gallop et al., (1994) J. Med. Chem., 37, 1233-1251; Gordon et 
al., (1994) J, Med. Chem., 37, 1385-1401; Carell et aL, (1995) Chem, Biol, 3, 171-183; 
Madden et al.. Perspectives in Drug Discovery and Design 2, 269-282; Lebl et al., (1995) 
Biopolymers, 37 177-198); small molecules assembled around a shared molecular structure; 
collections of chemicals that have been assembled by various commercial and 
noncommercial groups, natural products; extracts of marine organisms, fimgi, bacteria, and 
plants. 

[0295] Preferred libraries can be prepared in a homogenous reaction mixture, and 
separation of unreacted reagents from members of the library is not required prior to 
screening. Although many combinatorial chemistry approaches are based on solid state 
chemistry, liquid phase combinatorial chemistry is capable of generating libraries (Sun 
CM., (1999) Recent advances in liquid-phase combinatorial chemistry. Combinatorial 
Chemistry & High Throughput Screening. 2:299-318). 

[0296] Libraries of a variety of types of molecules are prepared in order to obtain 
members therefrom having one or more preselected attributes that can be prepared by a 
variety of techniques, including but not limited to parallel array synthesis (Houghton, 
(2000) Annu Rev Pharmacol Toxicol 40:273-82, Parallel array and mixture-based synthetic 
combinatorial chemistry; solution-phase combinatorial chemistry (Merritt, (1998) Comb 
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Chem High Throughput Screen l(2):57-72, Solution phase combinatorial chemistry, Coe et 
aL, (1998-99) Mol Divers; 4(l):31-8, Solution-phase combinatorial chemistry. Sun, (1999) 
Comb Chem High Throughput Screen 2(6):299-3 1 8, Recent advances in liquid-phase 
combinatorial chemistry); synthesis on soluble polymer (Gravert et al., (1997) Curr Opin 
Chem Biol 1(1): 107-13, Synthesis on soluble polymers: new reactions and the construction 
of small molecules); and the like. See, e.g., DoUe et aL, (1999) J Comb Chem l(4):235-82. 
Comprehensive survey of combinatorial library synthesis: 1998. Freidinger RM., (1999) 
Nonpeptidic ligands for peptide and protein receptors. Current Opinion in Chemical 
Biology; and Kundu et al.. Prog Drug Res;53:S9-156^ Combinatorial chemistry: polymer 
supported synthesis of peptide and non-peptide libraries). Compounds may be clinically 
tagged for ease of identification (Chabala, (1995) Curr Opin Biotechnol 6(6):633-9, Solid- 
phase combinatorial chemistry and novel tagging methods for identifying leads). 

[0297] ilie combinatorial synthesis of carbohydrates and libraries containing 
oUgosaccharides have been described (Schweizer et al., (1999) Curr Opin Chem Biol 
3(3):291-?5 Combinatorial synthesis of carbohydrates). The synthesis of natural-product 
based con -joimd libraries has been described (Wessjohann, (2000) Curr Opin Chem Biol 
4(3):303-9, Synthesis of natural-product based compound libraries). 

[0298] Librm v^s of nucleic acids are prepared by various techniques, including by way of 
non-limiting exarTtpl ^ the ones described herein, for the isolation of aptamers. Libraries that 
include oligonucleot.d£^5 and polyaminooligonucleotides (Markiewicz et al., (2000) 
Synthetic oligonucleotide combinatorial libraries and their applications, Farmaco. 55:174- 
7) displayed on streptavidin magnetic beads are known. Nucleic acid libraries are known 
that can be coupled to parallel sampling and be deconvoluted without complex procedures 
such as automated mass spectrometry (Enjalbal C. Martinez J. Aubagnac JL, (2000) Mass 
spectrometry in combinatorial chemistry. Mass Spectrometry Reviews, 19:139-61) and 
parallel tagging. (Perrin DM., Nucleic acids for recognition and catalysis: landmarks, 
limitations, and looking to the future. Combinatorial Chemistry & High Throughput 
Screening 3:243-69). 

[0299] Peptidomimetics are identified using combinatorial chemistry and solid phase 
synthesis (Kim HO. Kahn M., (2000) A merger of rational drug design and combinatorial 
chemistry: development and application of peptide secondary structure mimetics. 
Combinatorial Chemistry & High Throughput Screening 3:167-83; al-Obeidi, (1998) Mol 
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Biotechnol 9(3):205-23, Peptide and peptidomimetic libraries. Molecular diversity and drug 
design). The synthesis.»^|^?4.entirely random or based in part on a known polypeptide. 



10300] Polypeptide libraries can be prepared according to various techniques. In brief, 
phage display techniques can be used to produce polypeptide hgands (Gram H., (1999) 
Phage display in proteolysis and signal transduction. Combinatorial Chemistry & High 
Throughput Screening. 2:19-28) that may be used as the basis for synthesis of 
peptidomimetics. Polypeptides, constrained peptides, proteins, protein domains, antibodies, 
single chain antibody fragments, antibody fragments, and antibody combining regions are 
displayed on filamentous phage for selection. 

[0301] Large libraries of individual variants of human single chain Fv antibodies have 
been produced. See, e.g., Siegel RW. Allen B. Pavlik P. Marks ID. Bradbury A., (2000) 
Mass spectral analysis of a protein complex using single-chain antibodies selected on a 
peptide target: apphcations to fimctional genomics. Journal of Molecular Biology 302:285- 
93; Poul MA. Becerril B. Nielsen UB. Morisson P. Marks JD.,(2000) Selection of tumor- 
specific intemalizing human antibodies from phage hbraries. Source Journal of Molecular 
Biology. 301:1 149-61; Amersdorfer P. Marks JD., (2001) Phage libraries for generation of 
anti-botulinum scFv antibodies. Methods in Molecular Biology, 145:219-40; Hughes- Jones 
NC. Bye JM. Gorick BD. Marks JD. Ouwehand WH., (1999) Synthesis of Rh Fv phage- 
antibodies using VH and VL geimline genes, British Journal ofHaematology, 105:81 1-6; 
McCall AM. Amoroso AR. Sautes C. Marks JD. Weiner LM., (1998) Characterization of 
anti-mouse Fc gamma RH single-chain Fv fragments derived from human phage display 
libraries, Immunotechnology. 4:71-87; Sheets MD. Amersdorfer P. Finnem R. Sargent P. 
Lindquist E. Schier R. Hemingsen G. Wong C. Gerhart JC. Marks JD. Lindquist E., (1998) 
Efficient constmction of a large nonimmune phage antibody library: the production of high- 
affmity human single-chain antibodies to protein antigens (pubHshed erratum appears in 
Proc Natl Acad Sci US A 1999 96:795), Proc Natl Acad Sci USA 95:6157-62). 

[03021 Focused or smart chemical and pharmacophore libraries can be designed with the 
help of sophisticated strategies involving computational chemistry (e.g., Kundu B. Khare 
SK. Rastogi SK., (1999) Combinatorial chemistry: polymer supported synthesis of peptide 
and non-peptide libraries. Progress in Drug Research 53:89-156) and the use of structure- 
based ligands using database searching and docking, de novo drug design and estimation of 
ligand binding affinities (Joseph-McCarthy D., (1999) Computational approaches to 
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structure-based ligand design, Pharmacology & Therapeutics 84:179-91; Kirkpatrick DL. 
Watson S. Ulhaq S., (1999) Structure-based drag design: combinatorial chemistry and 
molecular modeling. Combinatorial Chemistry & High Throughput Screening. 2:21 1-21; 
Eliseev AV. Lehn JM., (1999) Dynamic combinatorial chemistry: evolutionary formation 
and screening of molecular libraries. Current Topics in Microbiology cS: Immunology 
243:159-72; Bolger et al., (1991) Methods Enz. 203:21-45; Martin, (1991) Methods Enz. 
203:587-613; Neidle et aL, (1991) Methods Enz. 203:433-458; U.S. Patent 6,178,384). 

Crystallography 

[0303] After binding compounds have been determined, the orientation of compound 
boimd to target is determined. Preferably this determination involves crystallography on co- 
crystals of molecular scaffold compounds with target. Most protein crystallographic 
platforms can preferably be designed to analyze up to about 500 co-complexes of 
compounds, ligands, or molecular scaffolds bound to protein targets due to the physical 
parameters of the instruments and convenience of operation. If the number of scaffolds that 
have binding activity exceeds a number convenient for the application of crystallography 
methods, the scaffolds can be placed into groups based on having at least one common 
chemical stmcture or other desirable characteristics, and representative compounds can be 
selected from one or more of the classes. Classes can be made with increasingly exacting 
criteria until a desired number of classes (e.g., 500) is obtained. The classes can be based 
on chemical stmcture similarities between molecular scaffolds in the class, e.g., all possess 
a pyrrole ring, benzene ring, or other chemical feature. Likewise, classes can be based on 
shape characteristics, e.g., space-filling characteristics. 

[0304] The co-crystallography analysis can be performed by co-complexing each scaffold 
with its target at concentrations of the scaffold that showed activity in the screening assay. 
This co-complexing can be accomplished with the use of low percentage organic solvents 
with the target molecule and then concentrating the target with each of the scaffolds. In 
preferred embodiments these solvents are less than 5% organic solvent such as dimethyl 
sulfoxide (DMSO), ethanol, methanol, or ethylene glycol in water or another aqueous 
solvent. Each scaffold complexed to the target molecule can then be screened with a 
suitable number of crystallization screening conditions at both 4 and 20 degrees. In 
preferred embodiments, about 96 crystallization screening conditions can be performed in 
order to obtain sufficient information about the co-complexation and crystallization 
conditions, and the orientation of the scaffold at the binding site of the target molecule. 
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Crystal structures can then be analyzed to determine how the bound scaffold is oriented 
physically within the binding site or within one or more binding pockets of the molecular 
family member. 

[0305] It is desirable to determine the atomic coordinates of the compounds bound to the 
target proteins in order to determine which is a most suitable scaffold for the protein family. 
X-ray crystallographic analysis is therefore most preferable for determining the atomic 
coordinates. Those compounds selected can be further tested with the application of 
medicinal chemistry. Compoimds can be selected for medicinal chemistry testing based on 
their binding position in the target molecule. For example, when the compound binds at a 
binding site, the compovmd's binding position in the binding site of the target molecule can 
be considered with respect to the chemistry that can be performed on chemically tractable 
structures or sub-structures of the compound, and how such modifications on the compound 
might interact with structures or sub-structures on the binding site of the target. Thus, one 
can explore the binding site of the target and the chemistry of the scaffold in order to make 
decisions on how to modify the scaffold to arrive at a ligand with higher potency and/or 
selectivity. This process allows for more direct design of ligands, by utilizing structural and 
chemical infbrmation obtained directly firom the co-complex, thereby enabling one to more 
efficiently and quickly design lead compounds that are likely to lead to beneficial drug 
products. In various embodiments it may be desirable to perform co-crystallography on all 
scaffolds that bind, or only those that bind with a particular affinity, for example, only those 
that bind with high affinity, moderate affinity, low affinity, very low affinity, or extremely 
low affinity. It may also be advantageous to perform co-crystallography on a selection of 
scaffolds that bind with any combination of affinities. 

[0306] Standard X-ray protein diffi-action studies such as by using a Rigaku RU-200® 
(Rigaku, Tokyo, Japan) with an X-ray imaging plate detector or a synchrotron beam-line 
can be performed on co-crystals and the diffiraction data measured on a standard X-ray 
detector, such as a CCD detector or an X-ray imaging plate detector. 

[0307] Performing X-ray crystallography on about 200 co-crystals should generally lead 
to about 50 co-crystals structures, which should provide about 10 scaffolds for validation in 
chemistry, which should finally result in about 5 selective leads for target molecules. 
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Virtual Assays 

[0308] Coimnercially available software that generates three-dimensional graphical 
representations of the complexed target and compound Jfrom a set of coordinates provided 
can be used to illustrate and study how a compound is oriented when bound to a target, 
(e.g., QUANTA®, Accelerys, San Diego, CA). Thus, the existence of binding pockets at the 
binding site of the targets can be particularly useful in the present invention. These binding 
pockets are revealed by the crystallographic structure determination and show the precise 
chemical interactions involved in binding the compound to the binding site of the target. 
The person of ordinary skill will realize that the illustrations can also be used to decide 
where chemical groups might be added, substituted, modified, or deleted from the scaffold 
to enhance binding or another desirable effect, by considering where imoccupied space is 
located in the complex and which chemical substructures might have suitable size and/or 
charge characteristics to fill it The person of ordinary skill will also realize that regions 
within the binding site can be flexible and its properties can change as a result of scaffold 
binding, and that chemical groups can be specifically targeted to those regions to achieve a 
desired effect. Specific locations on the molecular scaffold can be considered with 
reference to where a suitable chemical subsfcructure can be attached and in which 
conformation, and which site has the most advantageous chemistry available. 

[0309] An understanding of the forces that bind the compounds to the target proteins 
reveals which compounds can most advantageously be used as scaffolds, and which 
properties can most effectively be manipulated in the design of ligands. The person of 
ordinary skill will realize that steric, ionic, hydrogen bond, and other forces can be 
considered for their contribution to the maintenance or enhancement of the target-compound 
complex. Additional data can be obtained with automated computational methods, such as 
docking and/or Free Energy Perturbations (FEP), to account for other energetic effects such 
as desolvation penalties. The compoimds selected can be used to generate information 
about the chemical interactions with tiie target or for elucidating chemical modifications 
that can enhance selectivity of binding of the compoimd. 

[0310] Computer models, such as homology models {i.e., based on a known, 
experimentally derived structure) can be constructed using data fi-om the co-crystal 
stmctures. When the target molecule is a protein or enzyme, preferred co-crystal stmctures 
for making homology models contain high sequence identity in the binding site of the 
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protein sequence being modeled, and the proteins will preferentially also be within the same 
class and/or fold family. Knowledge of conserved residues in active sites of a protein class 
can be used to select homology models that accurately represent the binding site. 
Homology models can also be used to map stmctural information from a surrogate protein 
where an apo or co-crystal structure exists to the target protein. 

[0311] Virtual screening methods, such as docking, can also be used to predict the binding 
configuration and affinity of scaffolds, compounds, and/or combinatorial Hbrary members 
to homology models. Using this data, and carrying out "virtual experiments'* using 
computer software can save substantial resources and allow the person of ordinary skill to 
make decisions about which compounds can be suitable scaffolds or ligands, without having 
to actually synthesize the ligand and perform co-crystallization. Decisions thus can be 
made about which compounds merit actual synthesis and co-crystallization. An 
understanding of such chemical interactions aids in the discovery and design of drugs that 
interact more advantageously with target proteins and/or are more selective for one protein 
family member over otiiers. Thus, applying these principles, compounds with superior 
properties can be disco' ered. 

[0312] Additives that prot lote co-crystallization can of course be included in the target 
molecule formulation in older io e ahance the formation of co-crystals. In the case of 
proteins or enzymes, the scaffold \:o be tested can be added to the protein formulation, which 
is preferably present at a concentr.\ticn of approximately 1 mg/ml. The formulation can also 
contain between 0%-I0% (v/v) organic solvent, e.g. DMSO, methanol, ethanol, propane 
diol, or 1,3 dimethyl propane diol (MPD) or some combination of those organic solvents. 
Compounds are preferably solubilized in the organic solvent at a concentration of about 10 
mM and added to the protein sample at a concentration of about 100 mM. The protein- 
compound complex is then concentrated to a final concentration of protein of from about 5 
to about 20 mg/mL The complexation and concentration steps can conveniently be 
performed using a 96-well formatted concentration apparatus (e.g., Amicon Inc., 
Piscataway, NJ). Buffers and other reagents present in the formulation being crystallized 
can contain oflier components that promote crystallization or are compatible with 
crystallization conditions, such as DTT, propane diol, glycerol. 

[03131 The crystalhzation experiment can be set-up by placing small ahquots of the 
concentrated protein-compound complex (1 ^il) in a 96 well format and sampling under 96 
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crystallization conditions. (Other screening formats can also be used, e.g., plates with 
greater than 96 wells.) Crystals can typically be obtained using standard crystallization 
protocols that can involve the 96 well crystallization plate being placed at different 
temperatures. Co-crystallization varying factors other than temperature can also be 
considered for each protein-compound complex if desirable. For example, atmospheric 
pressure, the presence or .absence of light or oxygen, a change in gravity, and many other 
variables can all be tested. The person of ordinary skill in the art will realize other variables 
that can advantageously be varied and considered. 

Ligand Design and Preparation 

[0314J The design and preparation of ligands can be performed with or without structural 
and/or co-crystallization data by considering the chemical structures in common between 
the active scaffolds of a set. hi this process structure-activity hypotheses can be formed and 
those chemical structures found to be present in a substantial number of the scaffolds, 
including those that bind Avith low affinity, can be presumed to have some effect on the 
binding of the scaffold. This binding can be presumed to induce a desired biochemical 
effect when it occurs in a biological system (e.g., a treated mammal). New or modified 
scaffolds or combinatorial libraries derived from scaffolds can be tested to disprove the 
maximum number of binding and/or structure-activity hypotheses. The remaining 
hypotheses can then be used to design Ugands that achieve a desired binding and 
biochemical effect. 

[0315] But in many cases it will be preferred to have co-crystallography data for 
consideration of how to modify the scaffold to achieve the desired binding effect (e.g., 
binding at higher affinity or with higher selectivity). Using the case of proteins and 
enzymes, co-crystallography data shows the binding pocket of the protein with the 
molecular scaffold bound to the binding site, and it will be apparent that a modification can 
be made to a chemically tractable group on the scaffold. For example, a small volume of 
space at a protein binding site or pocket might be filled by modifying the scaffold to include 
a small chemical group that fills the volume. Filling the void volume can be expected to 
result in a greater binding affinity, or the loss of undesirable binding to another member of 
the protein family. Similarly, the co-crystallography data may show that deletion of a 
chemical group on the scaffold may decrease a hindrance to binding and result in greater 
binding affinity or specificity. 
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[0316] It can be desirable to take advantage of the presence of a charged chemical group 
located at the binding site or pocket of the protein. For example, a positively charged group 
can be complemented with a negatively charged group introduced on the molecular 
scaffold- This can be expected to increase binding affinity or binding specificity, thereby 
resulting in a more desirable ligand. In many cases, regions of protein binding sites or 
pockets are known to vary fix)m one family member to another based on the amino acid 
differences in those regions. Chemical additions in such regions can result in the creation or 
elimination of certain interactions (e.g., hydrophobic, electrostatic, or entropic) tibat allow a 
compound to be more specific for one protein target over another or to bind with greater 
affinity, thereby enabling one to synthesize a compound with greater selectivity or affinity 
for a particular family member. Additionally, certain regions can contain amino acids that 
are known to be more flexible than others. This often occurs in amino acids contained in 
loops connecting elements of the secondary structure of the protein, such as alpha helices or 
beta strands. Additions of chemical moieties can also be directed to these flexible regions 
in order to increase the likelihood of a specific interaction occurring between the protein 
target of interest and the compound. Virtual screening methods can also be conducted in 
silico to assess the effect of chemical additions, subtractions, modifications, and/or 
substitutions on compoimds with respect to members of a protein family or class. 

[03171 The addition, subtraction, or modification of a chemical structure or sub-structure 
to a scaffold can be performed with any suitable chemical moiety. For example the 
following moieties, which are provided by way of example and are not intended to be 
limiting, can be utilized: hydrogen, alkyl, alkoxy, phenoxy, alkenyl, alkynyl, phenylalkyl, 
hydroxyaUcyl, haloalkyl, aryl, arylalkyl, alkyloxy, alkylfliio, alkenylthio, phenyl, 
phenylalkyl, phenylalkylthio, hydroxyalkyl-thio, alkylthiocarbbamylthio, cyclohexyl, 
pyridyl, piperidinyl, alkylamino, amino, nitro, mercapto, cyano, hydroxyl, a halogen atom, 
halomethyl, an oxygen atom (e.g., forming a ketone or N-oxide) or a sulphur atom (e.g., 
forming a thiol, thione, di-alkylsulfoxide or sulfone) are all examples of moieties that can be 
utilized, 

[0318] Additional examples of structures or sub-stractures that may be utilized are an aryl 
optionally substituted with one, two, or three substituents independently selected fi-om the 
group consisting of alkyl, alkoxy, halogen, trihalomethyl, carboxylate, carboxamide, nitro, 
and ester moieties; an amine of formula -NX2X3, where X2 and X3 are independently 
selected fi-om the group consisting of hydrogen, saturated or unsaturated alkyl, and 
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homocyclic or heterocyclic ring moieties; halogen or trihalomethyl; a ketone of formula - 
COX4, where X4 is selected from the group consisting of alkyl and homocyclic or 
heterocyclic ring moieties; a carboxylic acid of formula -(X5)nCOOH or ester of formula 
(X6)nCOOX7, where X5, Xe, and X7 and are independently selected from the group 
consisting of alkyl and homocyclic or heterocyclic ring moieties and where n is 0 or 1; an 
alcohol of formula (X8)nOH or an alkoxy moiety of formula -(X8)nOX9, where Xg and X9 
are independently selected from the group consisting of saturated or unsaturated alkyl and 
homocyclic or heterocyclic ring moieties, wherein said ring is optionally substituted with 
one or more substituents independently selected from the group consisting of alkyl, alkoxy, 
halogen, trihalomethyl, carboxylate, nitro, and ester and where n is 0 or 1; an amide of 
formula NHCOXio, where Xio is selected from the group consisting of alkyl, hydroxyl, and 
homocyclic or heterocyclic ring moieties, wherein said ring is optionally substituted with 
one or more substituents independently selected from the group consisting of alkyl, alkoxy, 
halogen, trihalomethyl, carboxylate, nitro, and ester; SO2, NXn X12, where Xn and X12 are 
selected from the group consisting of hydrogen, alkyl, and homocyclic or heterocyclic ring 
moieties; a homocyclic or bete -ocyclic ring moiety optionally substituted with one, two, or 
three substituents independqnti selected from the group consisting of alkyl, alkoxy, 
halogen, trihalomethyl, carboxj^rt*. , carboxamide, nitro, and ester moieties; an aldehyde of 
formula -CHO; a sulfone of formula -^:O^Xl3, where X13 is selected from the group 
consisting of saturated or unsaturated ^Ik/i and homocyclic or heterocyclic ring moieties; 
and a nitro of formula -NO2. 

Identification of Attachment Sites on Molecular Scaffolds and Ligands 

[031 9J In addition to the identification and development of ligands for kinases and other 
enzymes, determination of the orientation of a molecular scafFold or other binding 
compound in a binding site allows identification of energetically allowed sites for 
attachment of the binding molecule to another component. For such sites, any free energy 
change associated with the presence of the attached component should not destablize the 
binding of the compound to the kinase to an extent that will disrupt the binding. Preferably, 
the binding energy with the attachment should be at least 4 kcal/moL, more preferably at 
least 6, 8, 10, 12, 15, or 20 kcal/moL Preferably, the presence of the attachment at the 
particular site reduces binding energy by no more than 3, 4, 5, 8, 10, 12, or 15 kcal/moL 

[0320] In many cases, suitable attachment sites will be those that are exposed to solvent 
when the binding compound is bound in the binding site. In some cases, attachment sites 
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can be used that will result in small displacements of a portion of the enzyme without an 
excessive energetic cost. Exposed sites can be identified in various ways. For example, 
exposed sites can be identified using a graphic display or 3-dimensional model. In a 
graphic display, such as a computer display, an image of a compound bound in a binding 
site can be visually inspected to reveal atoms or groups on the compound that are exposed 
to solvent and oriented such that attachment at such atom or group would not preclude 
binding of the enzyme and binding compomd. Energetic costs of attachment can be 
calculated based on changes or distortions that would be caused by the attachment as well as 
entropic changes. 

[0321] Many different types of componejats can be attached. Persons with skill are 
familiar with the chemistries used for various attachments. Examples of components that 
can be attached include, without limitation: solid phase components such as beads, plates, 
chips, and wells; a direct or indirect label; a linker, which may be a traceless linker; among 
others. Such linkers can themselves be attached to other components, e.g:, to solid phase 
media, labels, and/or binding moieties. 

[0322] The binding energy of a compound and the effects on binding energy for attaching 
the molecule to another component can be calculated approximately using any of a variety 
of available software or by manual calculation. An example is the following: 

[0323] Calculations were performed to estimate binding energies of different organic 
molecules to two Kinases: Pim-1 and CDIC2. The organic molecules considered included 
Staurosporine, identified compounds that bind to PIM-1, and several linkers. 

[0324] Calculated binding energies between protein-ligand complexes were obtained 
using the FlexX score (an implementation of the Bohm scoring function) within the Tripos 
software suite. The form for that equation is shown in Eqn. 1 below: 

AGbind = AGtr + AGhb + AGion + AGlipo + AGarom + AGrot 

[0325] where: AGtr is a constant term that accounts for the overall loss of rotational and 
translational entropy of the ligand, AGhb accounts for hydrogen bonds formed between the 
ligand and protein, AGion accoimts for the ionic interactions between the ligand and protein, 
AGlipo accounts for the lipophilic interaction that corresponds to the protein-ligand contact 
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surface, AGarom accounts for interactions between aromatic rings in the protein and ligand, 
and AGrot accounts for the entropic penalty of restricting rotatable bonds in the ligand upon 
binding. 

[0326] This method estimates the free energy that a lead compound should have to a 
target protein for which there is a crystal structure, and it accounts for the entropic penalty 
of flexible linkers. It can therefore be used to estimate the free energy penalty incurred by 
attaching linkers to molecules being screened and the binding energy that a lead compound 
should have in order to overcome the free energy penalty of the linker. The method does 
not account for solvation and the entropic penalty is likely overestimated for cases where 
the linker is bound to a solid phase through another binding complex, such as a 
biotin:streptavidin complex. 

[0327] Co-crystals were aligned by superimposing residues of PlM-1 with corresponding 
residues in CDK2. The PIM-1 structure used for these calculations was a co-crystal of 
PIM-1 with a binding compound. The CDK2:Staurosporine co-crystal used was from the 
Brookhaven database file laql. Hydrogen atoms were added to the proteins and atomic 
charges were assigned using the AMBER95 parameters within Sybyl. Modifications to the 
compounds described were made within the Sybyl modeling suite from Tripos. 

[0328] These calculations indicate that the calculated binding energy for compounds that 
bind strongly to a given target (such as Staurosporine:CDK2) can be lower than —25 
kcal/mol, while the calculated binding affinity for a good scaffold or an imoptimized 
binding compound can be in the range of -15 to -20. The free energy penalty for attachment 
to a linker such as the ethylene glycol or hexatriene is estimated as typically being in the 
range of +5 to +15 kcal/mol. 

Linkers 

[0329] Linkers suitable for use in the invention can be of many different types. Linkers 
can be selected for particular applications based on factors such as linker chemistry 
compatible for attachment to a binding compound and to another component utilized in the 
particular application. Additional factors can include, without limitation, linker length, 
linker stability, and ability to remove the linker at an appropriate time. Exemplary linkers 
include, but are not limited to, hexyl, hexatrienyl, ethylene glycol, and peptide linkers. 
Traceless linkers can also be used, e.g.^ as described in Plunkett, M. J., and EUman, J. A., 
(1995), J. P*-"- '^^^'M <^n-^nnA 
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[0330] Typical functional groups, that are utilized to link binding compound(s), include, 
but not limited to, carboxylic acid, amine, hydroxyl, and thiol. (Examples can be found in 
Solid-supported combinatorial and parallel synthesis of small molecular weight compound 
libraries; (1998) Tetrahedron organic chemistry series Vol. 17; Pergamon; p85). 

Labels 

[0331] As indicated above, labels can also be attached to a binding compound or to a 
linker attached to a binding compound. Such attachment may be direct (attached directly to 
the binding compound) or indirect (attached to a component that is directly or indirectly 
attached to the binding compound). Such labels allow detection of the compound either 
directly or indirectly. Attachment of labels can be performed using conventional 
chemistries. Labels can include, for example, fluorescent labels, radiolabels, light scattering 
particles, ligjat absorbent particles, magnetic particles, enzymes, and specific binding agents 
(e.g-, biotin or an antibody target moiety). 

Solid Phase Media 

[0332] Additional examples of components that can be attached directly or indirectly to a 
binding compound include various solid phase media. Similar to attachment of linkers and 
labels, attachment to solid phase media can be performed using conventional chemistries. 
Such solid phase media can include, for example, small components such as beads, 
nanoparticles, and fibers {e.g., in suspension or in a gel or chromatographic matrix). 
Likewise, solid phase media can include larger objects such as plates, chips, slides, and 
tubes. In many cases, the binding compound will be attached in only a portion of such an 
objects, e.g.y in a spot or other local element on a generally flat surface or in a well or 
portion of a well. 

Identification of Biological Agents 

[0333] The possession of structural information about a protein also provides for the 
identification of usefiil biological agents, such as epitopes for development of antibodies, 
identification of mutation sites expected to affect activity, and identification of attachment 
sites allowing attachment of the protein to materials such as labels, linkers, peptides, and 
solid phase media. 

[0334J Antibodies (Abs) finds multiple applications in a variety of areas including 
biotechnology, medicine and diagnosis, and indeed they are one of the most powerful tools 
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for life science research. Abs directed against protein antigens can recognize either linear or 
native three-dimensional (3D) epitopes. The obtention of Abs that recognize 3D epitopes 
require the use of whole native protein (or of a portion that assumes a native conformation) 
as immunogens. Unfortvmately, this not always a choice due to various technical reasons: 
for example the native protein is just not available, the protein is toxic, or its is desirable to 
utilize a high density antigen presentation. In such cases, immunization with peptides is the 
alternative. Of course, Abs generated in this manner will recognize linear epitopes, and they 
might or might not recognize the source native protein, but yet they will be useful for 
standard laboratory applications such as western blots. The selection of peptides to use as 
immunogens can be accomplished by following particular selection rules and/or use of 
epitope prediction software. 

[0335] Though methods to predict antigenic peptides are not infallible, there are several 
mles that can be followed to determine what peptide fragments from a protein are likely to 
be antigenic. These rales are also dictated to increase the likelihood that an Ab to a 
particular peptide will recognize the native protein. 

• 1 . Antigenic peptides should be located in solvent accessible regions and contain 
both hydrophobic and hydrophilic residues. 

o For proteins of known 3D structure, solvent accessibility can be determined 
using a variety of programs such as DSSP, NACESS, or WHATIF, among 
others. 

o If the 3D structure is not known, use any of the following web servers to 
predict accessibilities: PHD, JPRED, PredAcc (c) ACCoro 

• 2. Preferably select peptides lying in long loops connecting Secondary Structure 
(SS) motifs, avoiding peptides located in helical regions. This will increase the odds 
that the Ab recognizes the native protein. Such peptides can, for example, be 
identified from a crystal structure or crystal structure-based homology model. 

o For protein with known 3D coordinates, SS can be obtained from the 
sequence link of the relevant entry at the Brookhaven data bank . The 
PDBsum s erver also offer SS analysis of pdb records. 
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o When no structure is available secondary structure predictions can be 
obtained jfrom any of the following servers: PHD. JPRED. PSI-PRED. 
NNSP. etc 

• 3. When possible, choose peptides that are in the N- and C-terminaJ region of the 
protein. Because Ihe N~ and C- terminal regions of proteins are usually solvent 
accessible and unstructured, Abs against those regions are also likely to recognize 
the native protein. 

• 4. For cell surface glycoproteins, eliminate from initial peptides those containing 
consensus sites for N-glycosilation. 

o N-glycosilation sites can be detected using Scanprosite, or NetNGlvc 

[0336] hi addition, several methods based on various physio-chemical properties of 
experimental determined epitopes (flexibility, hydrophibihty, accessibiHty) have been 
published for the prediction of antigenic determinants and can be used. The antigenic index 
and Preditop are example. 

[03371 Perhaps the simplest method for the prediction of antigenic determinants is that of 
Kolaskar and Tongaonkar, which is based on the occurrence of amino acid residues in 
experimentally determined epitopes. (Kolaskar and Tongaonkar (1990) A semi-empirical 
method for prediction of antigenic determinants on protein antigens. FEBBS Lett 276(1- 
2): 172-1 74.) The prediction algorithm works as follows: 

• 1. Calculate the average propensity for each overlapping 7-mer and assign the result 
to the central residue (i+3) of the 7-mer. 

• 2. Calculate the average for the whole protein. 

• 3. (a) If the average for the whole protein is above 1 .0 then all residues having 
average propensity above 1.0 are potentially antigenic. 

• 3. (b) If the average for the whole protein is below 1 .0 then all residues having 
above the average for the whole protein are potentially antigenic. 

• 4. Find 8-mers where all residues are selected by step 3 above (6-mers in the original 
paper) 
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[0338] The Kolaskar and Tongaonkar method is also available from the GCG package. 



to alter the activity of the protein. Such residues include, for example, residues that interact 
with substrate, conserved active site residues, and residues that are in a region of ordered 
secondary structure of involved in tertiary interactions. The mutations that are likely to 
affect activity will vary for different molecular contexts. Mutations in an active site that 
will affect activity are typically substitutions or deletions that eliminate a charge-charge or 
hydrogen bonding interaction, or introduce a steric interference. Mutations in secondary 
structure regions or molecular interaction regions that are likely to affect activity include, 
for example, substitutions that alter the hydrophobicity/hydrophihcity of a region, or that 
introduce a sufficient strain in a region near or including the active site so that critical 
residue(s) in the active site are displaced. Such substitutions and/or deletions and/or 
insertions are recognized, and the predicted structural and/or energetic effects of mutations 
can be calculated using conventional software. 

IX. Kinase Activity Assays 

[0340] A number of different assays for kinase activity can be utilized for assaying for 
active modulators and/or determining specificity of a modulator for a particular kinase or 
group or kinases. In addition to the assays mentioned below, one of ordinary skill in the art 
will know of other assays that can be utilized and can modify an assay for a particular 
application. 

[0341] An assay for kinase activity that can be used for kinases, eg., PIM-1, can be 
performed according to the following procedure using purified kinase using myelin basic 
protein (MBP) as substrate. An exemplary assay can use the following materials: MB? 
(M-1891, Sigma); Kinase buffer (KB = HEPES 50 mM, pH7.2, MgChrMnCh (200 pM:200 
pM); ATP (y-^^P):NEG602H (10 mCi/mL)(Perkin-Ehner); ATP as 100 mM stock in kinase 
buffer; EDTA as 100 mM stock solution. 

[0342] Coat scintillation plate suitable for radioactivity counting (e.g. , FlashPlate from 
Perkin-EImer, such as the SMP200(basic)) with kinase+MBP mix (final 100 ng+300 
ng/well) at 90 -nL/well in kinase buffer. Add compounds at 1 |iL/well firom 10 mM stock 
in DMSO. Positive control wells are added with 1 jjJL of DMSO. Negative control wells 




[0339] Crystal structures also allow identification of residues at which mutation is likely 
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are added with 2 jaL of EDTA stock solution. ATP solution (10 iiiL) is added to each well 
to provide a final concentration of cold ATP is 2 |aM, and 50 nCi ATP7[^^P]. The plate is 
shaken briefly, and a count is taken to initiate count (IC) using an apparatus adapted for 
coxmting with the plate selected^ e,g,, Perkin-Elmer Trilux. Store the plate at 37°C for 4 hrs, 
then count again to provide final count (FC). 

(0343J Net 33P incorporation (NI) is calculated as: NI = FC - IC. 

[0344] The effect of the present of a test compound can then be calculated as the percent 
of the positive control as: %PC = [(NI - NC) / (PC - NC)] x 1 00, where NC is the net 
incorporation for the negative control, and PC is the net incorporation for the positive 
control. 

[0345] As indicated above, other assays can also be readily used. For example, kinase 
activity can be measured on standard polystyrene plates, using biotinylated MBP and 
ATP7[^^P] and with Streptavidin-coated SPA (scintillation proximity) beads providing the 
signal. 

[0346] Additional alternative assays can employ phospho-specific antibodies as detection 
reagents with biotinylated peptides as substrates for the kinase. This sort of assay can be 
formatted either in a fluorescence resonance energy transfer (FRET) fomiat, or using an 
AlphaScreen (amplified /uminescent jproximity homogeneous assay) format by varying the 
donor and acceptor reagents that are attached to streptavidin or the phosphor-specific 
antibody. 

X. Organic Synthetic Techniques 

[0347] The versatility of computer-based modulator design and identification lies in the 
diversity of structures screened by the computer programs. The computer programs can 
search databases that contain very large numbers of molecules and can modify modulators 
already complexed with the enzyme with a wide variety of chemical fiinctional groups. A 
consequence of this chemical diversity is that a potential modulator of kinase fimction may 
take a chemical form that is not predictable. A wide array of organic synthetic techniques 
exist in the art to meet the challenge of constructing these potential modulators. Many of 
these organic synthetic methods are described in detail in standard reference sources utilized 
by those skilled in the art. One example of such a reference is March, 1994, Advanced 
Organic Chemistry: Reactions, Mechanisms and Structure, New York, McGraw Hill. Thus, 
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the techniques useful to synthesize a potential modulator of kinase function identified by 
computer-based methods are readily available to those skilled in the art of organic chemical 
synthesis. 

XI. Isomers, Prodrugs, and Active Metabolites 

[0348] The present compoxmds are described herein with generic formulas and specific 
compoimds. La addition, the present compounds may exist in a number of different forms or 
derivatives, all within the scope of the present invention. These include, for example, 
tautomers, enantiomers^, stereoisomers, racemic mixtures, regioisomers, salts, prodrugs (e.g., 
carboxylic acid esters), solvated forms, different crystal forms or polymorphs, and active 
metabolites 

A. Tautomers, Stereoisomers, Regioisomers, and Solvated Forms 

[0349] It is understood that certain compounds may exhibit tautomerism. in such cases, 
the formula drawings within this specification expressly depict only one of the possible 
tautomeric forms. It is therefore to be understood that within the invention the fomvulas are 
intended to represent any tautomeric form of the dv picted compounds and are not to be 
limited merely to the specific tautomeric form depicceii ty the formula drawings. 

[0350) Likewise, some of the present compounds may 4 oii :ain one or more chiral centers 
and therefore, may exist in two or more stereoisomeric forai i. Thus, such compounds may 
be present as single stereoisomers (i.e., essentially free of other stereoisomers), racemates, 
and/or mixtures of enantiomers and/or diastereomers. All such single stereoisomers^ 
racemates and mixtures thereof are intended to be within the scope of the present invention. 
Unless specified to the contrary, all such steroisomeric forms are included within the 
formulas provided herein. 

[0351] In certain embodiments, a chiral compound of the present invention is in a form 
that contains at least 80% of a single isomer (60% enantiomeric excess ("e.e.") or 
diastereomeric excess ("d.e.'*)), or at least 85% (70% e.e. or d.e.), 90% (80% e.e. or d.e.), 
95% (90% e.e. or d.e.), 97.5% (95% e.e. or d.e.), or 99% (98% e.e. or d.e,). As generally 
understood by those skilled in the art, an optically pm-e compound having one chiral center 
is one that consists essentially of one of the two possible enantiomers (i.e., is 
enantiomerically piure), and an optically pure compound having more than one chiral center 
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is one that is both diastereomericaliy piire and enantiomerically pure. In certain 
embodiments, the comig^^f^ present in optically pure form. 

[0352] For compounds is which synthesis involves addition of a single group at a double 
bond, particularly a carbon-carbon double bond, the addition may occur at either of the 
double bond-linked atoms. For such compounds, the present invention includes both such 
regioisomers- 

[0353] Additionally, the formulas are intended to cover solvated as well as unsolvated 
forms of the identified structures- For example, the indicated stmctures include both both 
hydrated and non-hydrated forms. Other examples of solvates include the stmctures in 
combination with isopropanol, ethanol, methanol, DMSO, ethyl acetate, acetic acid, or 
ethanolamine. 

B. Prodrugs and Metabolites 

[0354] In addition to the present formulas and compounds described herein, the mvention 
also includes prodrugs (generally pharmaceutically acceptable prodrags), active metabolic 
derivatives (active metabolites), and their pharmaceutically acceptable salts. 

[0355] In this context, prodmgs are compounds that may be converted under 
physiological conditions or by solvolysis to the specified compound or to a 
pharmaceutically acceptable salt of such a compound. A common example is an alkyl ester 
of a carboxylic acid. 

[0356] As described in The Practice of Medicinal Chemistry, Ch. 31-32 (Ed. Wermuth, 
Academic Press, San Diego, CA, 2001), prodrugs can be conceptually divided into two non- 
exclusive categories, bioprecursor prodrugs and carrier prodrugs. Generally, bioprecursor 
prodrugs are compounds are inactive or have low activity compared to the corresponding 
active drug compound, that contain one or more protective groups and are converted to an 
active form by metaboUsm or solvolysis. Both the active drag form and any released 
metabolic products should have acceptably low toxicity. Typically, the formation of active 
drug compound involves a metaboUc process or reaction that is one of the follow types: 

[0357] Oxidative reactions, such as oxidation of alcohol, carbonyl, and acid fimctions, 
hydroxylation of aliphatic carbons, hydroxylation of alicyclic carbon atoms, oxidation of 
aromatic carbon atoms, oxidation of carbon-carbon double bonds, oxidation of nitrogen- 
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containing functional groups, oxidation of silicon, phosphorus^ arsenic, and sulfur, oxidative 
N-delakylation, oxidative O- and S-delakylation, oxidative deanndnation, as well as otr^er 
oxidative reactions. 

[0358] Reductive reactions, such as reduction of carbonyl groups, reduction of alcoholic 
groups and carbon-carbon double bonds, reduction of nitrogen-containing functions groups, 
and other reduction reactions. 

[0359] Reactions without change in the state of oxidation, such as hydrolysis of esters and 
ethers, hydrolytic cleavage of carbon-nitrogen single bonds, hydrolytic cleavage of non- 
aromatic heterocycles, hydration and dehydration at multiple bonds, new atomic linkages 
resulting from dehydration reactions, hydrolytic dehalogenation, removal of hydrogen 
halide molecule, and other such reactions. 

[0360] Carrier prodrugs are drug compounds that contain a transport moiety, e.g., that 
improves uptake and/or localized delivery to a site(s) of action. Desirably for such a carrier 
prodrug, the linkage between the drug moiety and the transport moiety is a covalent bond, 
the prodrug is inactive or less active than the dmg compound, the prodrug and any release 
transport moiety are acceptably non-toxic. For prodrugs where the transport moiety in 
intended to enhance uptake, typically the release of the transport moiety should be rapid. In 
other cases, it is desirable to utilize a moiety that provides slow release, e.g., certain 
polymers or other moieties, such as cyclodextrins. (See, e.g., Cheng et al., U.S. Patent publ. 
20040077595, appL 10/656,838, incorporated herein by reference.) Such carrier prodrags 
are often advantageous for orally administered drugs. Carrier prodrags can, for example, be 
used to improve one or more of the following properties: increased lipophilicity, increased 
duration of pharmacological effects, increased site-specificity, decreased toxicity and 
adverse reactions, and/or improvement in drug formulation (e.g., stability, water solubility, 
suppression of an undesirable organoleptic or physiochemical property). For example, 
lipophilicity can be increased by esterification of hydroxyl groups with lipophilic carboxylic 
acids, or of carboxylic acid groups with alcohols, e.g., aliphatic alcohols. Wermuth, The 
Practice of Medicinal Chemistry , Ch. 31-32, Ed. Wermuth, Academic Press, San Diego, 
CA, 2001. 

[0361] Prodrugs may proceed from prodrug form to active form in a single step or may 
have one or more intermediate forms which may themselves have activity or may be 
inactive. 
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[0362] Metabolites, e.g., active metabolites overlap with prodrugs as described above, 
e.g., bioprecursor prodrugs. Thus, such metabolites are pharmacologically active 
compounds or compounds that further metabolize to pharmacologically active compounds 
that are derivatives resulting from metabolic process in the body of a subject or patient. Of 
these, active metabolites are such pharmacologically active derivative compounds. For 
prodrugs, the prodrug compounds is generally inactive or of lower activity than the 
metabolic product. For active metabolites, the parent compound may be either an active 
compound or may be an inactive prodrug. 

[0363] Prodrugs and active metabolites may be identified using routine techniques know 
in the art. See, e.g., Bertolini et al, 1997, J Med Chem 40:201 1-2016; Shan et al., JPharm 
Sci 86:756-757; Bagshawe, 1 995, Drug Dev Res 34:220-230; Wermuth, The Practice of 
Medicinal Chemistry. Ch. 31-32, Academic Press, San Diego, CA, 2001. 

C. Pharmaceuticaily acceptable salts 

[0364] Compounds can be formulated as or be in the fonii of pharmaceuticaily acceptable 
salts. Pharmaceuticaily acceptable salts are nontoxic sa s in the amounts and 
concentrations at which they are administered. The prepa;a^'oB oi'such salts can facilitate 
the pharmacological use by altering the physical charactenstic,:;; ^. f a compound without 
preventing it from exerting its physiological effect. UseM aJteratk as in physical properties 
include lowering the melting point to facilitate transmucosal adm h istration and increasing 
the solubility to facilitate administering higher concentrations of the dru g. 

[0365] Pharmaceuticaily acceptable salts include acid addition salts such as those 
containing sulfate, chloride, hydrochloride, fumarate, maleate, phosphate, sulfamate, 
acetate, citrate, lactate, tartrate, methanesulfonate, ethanesulfonate, benzenesulfonate,/>- 
toluenesulfonate, cyclohexylsulfamate and quinate. Pharmaceuticaily acceptable salts can 
be obtained from acids such as hydrochloric acid, maleic acid, sulfiiric acid, phosphoric 
acid, sulfamic acid, acetic acid, citric acid, lactic acid, tartaric acid, malonic acid, 
methanesulfonic acid, ethanesulfonic acid, benzenesulfonic acid,77-toluenesulfonic acid, 
cyclohexylsulfamic acid, fumaric acid, and quinic acid. 

[0366] Pharmaceuticaily acceptable salts also include basic addition salts such as those 
containing benzathine, chloroprocaine, choline, diethanolamine, ethylenediamine, 
meglumine, procaine, aluminum, calcium, lithiiun, magnesium, potassium, sodium. 
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ammonium, alkylamine, and zinc, when acidic functional groups, such as carboxyhc acid or 
phenol are present. For example, see Remington^s Pharmaceutical Sciences . 19^^ ed.. Mack 
Publishing Co., Easton, PA, Vol. 2, p. 1457, 1995. Such salts can be prepared using the 
appropriate corresponding bases. 

[0367] Pharmaceutically acceptable salts can be prepared by standard techniques. For 
example, the free-base form of a compound is dissolved in a suitable solvent, such as an 
aqueous or aqueous-alcohol in solution containing the appropriate acid and then isolated by 
evaporating the solution. In another example, a salt is prepared by reacting the free base 
and acid in an organic solvent. 

[0368] Thus, for example, if the particular compoamd is a base, the desired 
pharmaceutically acceptable salt may be prepared by any suitable method available in the 
art, for example, treatment of the free base v^ith an inorganic acid, such as hydrochloric 
acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid and the like, or with an 
organic acid, such as acetic acid, maleic acid, succinic acid, mandelic acid, fumaric acid, 
malonic acid, pyruvic acid, oxalic acid, glycolic acid, salicylic acid, a pyranosidyl acid, such 
as glucuronic acid or galacturonic acid, an alpha-hydroxy acid, such as citric acid or tartaric 
acid, an amino acid, such as aspartic acid or glutamic acid, an aromatic acid, such as 
benzoic acid or cinnamic acid, a sulfonic acid, such as p-toluenesulfonic acid or 
ethanesulfonic acid, or the like. 

[03691 Similarly, if the particular compound is an acid, the desired pharmaceutically 
acceptable salt may be prepared by any suitable method, for example, treatment of the free 
acid with an inorganic or organic base, such as an amine (primary, secondary or tertiary), an 
alkali metal hydroxide or alkaline earth metal hydroxide, or the like. Illustrative examples 
of suitable salts include organic salts derived from amino acids, such as glycine and 
arginine, ammonia, primary, secondary, and tertiary amines, and cyclic amines, such as 
piperidine, morpholine and piperazine, and inorganic salts derived from sodium, calcium, 
potassium, magnesium, manganese, iron, copper, zinc, aluminum and lithium. 

[0370] The pharmaceutically acceptable salt of the different compounds may be present as 
a complex. Examples of complexes include 8-chlorotheophylline complex (analogous to, 
e.g., dimenhydrinate: diphenhydramine 8-chlorotheophylline (1:1) complex; Dramamine) 
and various cyclodextrin inclusion complexes. 
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[0371] Unless specified to the contrary, specification of a compound herein includes 
pharmaceutically acceptable salts of such compound. 

Polymorphic forms 

[0372] In the case of agents that are soUds, it is understood by those skilled in the art that 
the compounds and salts may exist in different crystal or polymorphic forms, all of which 
are intended to be within the scope of the present invention and specified formulas. 

XII. Administration 

[0373] The methods and compounds will typically be used in therapy for human patients. 
However, they may also be used to treat similar or identical diseases in other vertebrates 
such as other primates, sports animals, and pets such as horses, dogs and cats. 

[0374] Suitable dosage forms, in part, depend upon the use or the route of administration, 
for example, oral, transdermal, transmucosal, or by injection (parenteral). Such dosage 
forms should allow the compound to reach target cells. Other factors are well known in the 
art, and include considerations such as toxicity and dosage forms that retard the compound 
or composition from exerting its effects. Techniques and formulations generally may be 
found in Remington's Pharmaceutical Sciences, 18^ ed.. Mack PubUshing Co., Easton, PA, 
1990 (hereby incorporated by reference herein). 

[0375] Carriers or excipients can be used to produce pharmaceutical compositions. The 
carriers or excipients can be chosen to facilitate administration of the compound. Examples 
of carriers include calcium carbonate, calcium phosphate, various sugars such as lactose, 
glucose, or sucrose, or types of starch, cellulose derivatives, gelatin, vegetable oils, 
polyethylene glycols and physiologically compatible solvents. Examples of physiologically 
compatible solvents include sterile solutions of water for injection (WFI), saline solution, 
and dextrose. 

[0376] The compounds can be admixiistered by different routes including intravenous, 
intraperitoneal, subcutaneous, intramuscular, oral, transmucosal, rectal, or transdermal. 
Oral administration is preferred. For oral administration, for example, the compounds can 
be formulated into conventional oral dosage forms such as capsules, tablets, and liquid 
preparations such as syrups, elixirs, and concentrated drops. 
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[0377] Pharmaceutical preparations for oral use can be obtained, for example, by 
combining the active compounds with solid excipients, optionally grinding a resuhing 
mixture, and processing the mixture of granules, after adding suitable auxiharies, if desired, 
to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; cellulose preparations, for example, maize 
starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, 
hydroxypropyhnethyl-cellulose, sodium carboxymethylcellulose (CMC), and/or 
polyvinylpyrrolidone (PVP: povidone). If desired, disintegrating agents maybe added, such 
as the cross— linked polyvinylpyrrolidone, agar, or alginic acid, or a salt thereof such as 
sodium alginate. 

[0378] Dragee cores are provided with suitable coatings. For this purpose, concentrated 
sugar solutions may be used, which may optionally contain, for example, gum arabic, talc, 
poly-vinylpyrrolidone, carbopol gel, polyethylene glycol (PEG), and/or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent mixtures. Dye-stufiFs or pigments 
may be added to the tablets or dragee coatings for identification or to characterize different 
combinations of active compound doses. 

[0379] Pharmaceutical preparations that can be used orally include push-fit capsules made 
of gelatin ("gelcaps"), as well as sofl:, sealed capsules made of gelatin, and a plasticizer, 
such as glycerol or sorbitol. The push-fit capsules can contain the active ingredients in 
admixture with filler such as lactose, binders such as starches, and/or lubricants such as talc 
or magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds 
may be dissolved or suspended in suitable liquids, such as fatty oils, Kquid paraffin, or 
liquid polyethylene glycols (PEGs). hi addition, stabilizers may be added. 

[0380] Alternatively, injection Cparenteral administration) may be used, e.g., 
intramuscular, intravenous, intraperitoneal, and/or subcutaneous. For injection, the 
compounds of the invention are formulated in sterile liquid solutions, preferably in 
physiologically compatible buffers or solutions, such as saline solution. Hank's solution, or 
Ringer's solution. In addition, the compounds may be formulated in solid form and 
redissolved or suspended immediately prior to use. Lyophihzed forms can also be 
produced. 

[0381] Administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



95 



PCT/US2004/030360 



permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, bile salts and fusidic acid derivatives. 
In addition, detergents may be used to facilitate permeation. Transmucosal administration, 
for example, may be through nasal sprays or suppositories (rectal or vaginal). 

[0382] The amounts of various compound to be administered can be determined by 
standard procedures taking into account factors such as the compound ICso^ the biological 
half-life of the compound, the age, size, and weight of the patient, and the disorder 
associated with the patient. The importance of these and other factors are well known to 
those of ordinary skill in the art. Generally, a dose will be between about 0.01 and 50 
mg/kg, preferably 0.1 and 20 mg/kg of the patient being treated. Multiple doses may be 
used. 

XIII. Manipulatioa of Kinase Coding Sequences 

[0383] Through the availability of the coding sequences for many different kinases, any of 
a variety of different molecular techniques can be performed as desired, -e.g., cloning, 
construction of recombinant sequences, production and purification of recombinant protein, 
introduction of particular kinase sequences into other organisms, and the like. 

[0384] Techniques for the manipulation of nucleic acids, such as, e.g., subcloning, 
labeling probes (e.g., random-primer labeling using Klenow polymerase, nick translation, 
amplification), sequencing, hybridization and the like are well disclosed in the scientific and 
patent literature, see, e.g., Sambrook, ed.. Molecular Cloning: a Laboratory Manual (2nd 
ed.). Vols. 1-3, Cold Spring Harbor Laboratory, (1989); Current Protocols in Molecular 
Biology, Ausubel, ed. John Wiley & Sons, Inc., New York (1997); Laboratory Techniques 
in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part L 
Theory and Nucleic Acid Preparation, Tijssen, ed. Elsevier, N.Y. (1993). 
[0100] Nucleic acid sequences can be ampUfied as necessary for further use using 
amplification methods, such as PCR, isothermal methods, rolling circle methods, etc., are 
well known to the skilled artisan. See, e.g., Saiki, "Amplification of Genomic DNA" in 
PCR Protocols, hmis et al., Eds., Academic Press, San Diego, CA 1990, pp 13-20; Wharam 
et al.. Nucleic Acids Res. 2001 Jun 1;29(1 1):E54-E54; Hafiier et al., Biotechniques 2001 
Apr;30(4):852-6, 858, 860 passim; Zhong et al., Biotechniques 2001 Apr;30(4):852-6, 858, 
860 passim. 
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[0385] Nucleic acids, vectors, capsids, polypeptides, and the like can be analyzed and 
quantified by any of a number of general means well known to those of skill in the art. 
These include, e.g., analytical biochemical methods such as NMR, spectrophotometry, 
radiography, electrophoresis, capillary electrophoresis, high performance liquid 
chromatography (HPLC), thin layer chromatography (TLC), and hyperdiffiision 
chromatography, various immunological methods, e.g. fluid or gel precipitin reactions, 
inununodiffusion, immimo-electrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELISAs), immuno-fluorescent assays. Southern analysis. Northern 
analysis, dot-blot analysis, gel electrophoresis (e.g., SDS-PAGE), nucleic acid or target or 
signal amplification methods, radiolabeling, scintillation counting, and affinity 
chromatography. 

[0386] Obtaining and manipulating nucleic acids used to practice the methods of the 
invention can be performed by cloning from "genomic samples, and, if desired, screening 
and re-cloning inserts isolated or amplified from, e.g., genomic clones or cDNA clones. 
Sources of nucleic acid used in the methods of the invention include g^momic or cDNA 
libraries contained in, e.g., mammalian artificial chromosomes (MACi \ see, e.g., U.S. 
Patent Nos. 5,721,1 18; 6,025,155; human artificial chromosomes, see, t g., RosenfeJd 
(1997) Mj^. Genet 15:333-335; yeast artificial chromosomes (YAC); bacteria: artificial 
chromosomes (BAG); PI artificial chromosomes, see, e.g., Woon (1998) Gen^m 'cs 50:306- 
316; Pl-derived vectors (PACs), see, e.g.. Kern (1997) Biotechniques 23:120-124; cosmids, 
recombinant viruses, phages or plasmids. 

[03871 The nucleic acids of the invention can be operatively linked to a promoter. A 
promoter can be one motif or an array of nucleic acid control sequences which direct 
transcription of a nucleic acid. A promoter can include necessary nucleic acid sequences 
near the start site of transcription, such as, in the case of a polymerase II type promoter, a 
TATA element. A promoter also optionally includes distal enhancer or repressor elements 
which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter which is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter which 
is under environmental or developmental regulation, A "tissue specific" promoter is active 
in certain tissue types of an organism, but not in other tissue types fi-om the same organism. 
The term "operably linked" refers to a fimctional linkage between a nucleic acid expression 
control sequence (such as a promoter, or array of transcription factor binding sites) and a 
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second nucleic acid sequence, wherein the expression control sequence directs transcription 
of the nucleic acid corresponding to the second sequence. 

[0388] The nucleic acids of the invention can also be provided in expression vectors and 
cloning vehicles, e.g., sequences encoding the polypeptides of the invention. Expression 
vectors and cloning veliicles of the invention can comprise viral particles, baculovirus, 
phage, plasmids, phagemids, cosmids, fosmids, bacterial artificial chromosomes, viral DNA 
(e.g., vaccinia, adenovirus, foul pox virus, pseudorabies and derivatives of SV40), PI -based 
artificial chromosomes, yeast plasmids, yeast artificial chromosomes, and any other vectors 
specific for specific hosts of interest (such as bacillus, Aspergillus and yeast). Vectors of 
the invention can include chromosomal, non-chromosomal and synthetic DNA sequences. 
Large numbers of smtable vectors are known to those of skill in the art, and are 
commercially available. 

[0389] The nucleic acids of the invention can be cloned, if desired, into any of a variety of 
vectors using routine molecular biological methods; methods for cloning in vitro ampUfied 
nucleic acids are disclosed, e,g., U.S. Pat. No. 5,426,039. To facilitate cloning of amplified 
sequences, restriction enzyme sites can be "built into" a PGR primer pair. Vectors may be 
introduced into a genome or into the cytoplasm or a nucleus of a cell and expressed by a 
variety of conventional techniques, well described in the scientific and patent literature. 
See, e.g., Roberts (1987) Nature 328:73 1; Schneider (1995) Protein Expr. Purif. 6435:10; . 
Sambrook, Tijssen or Ausubel. The vectors can be isolated firom natural sources, obtained 
fix)m such sources as ATCC or GenBank libraries, or prepared by synthetic or recombinant 
methods. For example, the nucleic acids of the invention can be expressed in expression 
cassettes, vectors or viruses which are stably or transiently expressed in cells (e.g., episomal 
expression systems). Selection markers can be incorporated into expression cassettes and 
vectors to confer a selectable phenotype on transformed cells and sequences. For example, 
selection markers can code for episomal maintenance and replication such that integration 
into the host genome is not required. 

[0390] In one aspect, the nucleic acids of the invention are administered in vivo for in situ 
expression of the peptides or polypeptides of the invention. The nucleic acids can be 
administered as "naked DNA'' (see, e.g., U.S. Patent No. 5,580,859) or in the form of an 
expression vector, e.g., a recombinant vims. The nucleic acids can be administered by any 
route, including peri- or intra-tumorally, as described below. Vectors administered in vivo 
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can be derived from viral genomes, including recombinantly modified enveloped or 
non-enveloped DNA and RNA viruses, preferably selected from baculoviridiae, 
parvoviridiae, picomoviridiae, herpesveridiae, poxviridae^ adenoviridiae, or 
picomnaviridiae. Chimeric vectors may also be employed which exploit advantageous 
merits of each of the parent vector properties (See e.g., Feng (1997) Nature Biotechnology 
1 5:866-870). Such viral genomes may be modified by recombinant DNA techniques to 
include the nucleic acids of the invention; and may be further engineered to be replication 
deficient, conditionally replicating or replication competent. In alternative aspects, vectors 
are derived from the adenoviral (e.g,, replication incompetent vectors derived from the 
human adenovirus genome, see, e.g., U.S. Patent Nos. 6,096,718; 6,110,458; 6,113,913; 
5,631,236); adeno-associated viral and retroviral genomes. Retroviral vectors can include 
those based upon murine leukemia virus (MuLV), gibbon ape leukemia vims (GaLV), 
Simian hnmuno deficiency virus (SIV), human immuno deficiency vims (HIV), and 
combinations thereof; see, e.g,, U.S. Patent Nos. 6,117,681; 6,107,478; 5,658,775; 
5,449,614; Buchscher (1992) J. Virol. 66:2731-2739; Johann (1992) J. Virol 66:1635- 
1640). Adeno-associated viras (AAV)-based vectors can be used to transduce cells with 
target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and in in 
vivo and ex vivo gene therapy procedures; see, e.g., U.S. Patent Nos. 6,1 10,456; 5,474,935; 
Okada (1996) Gene Ther, 3:957-964. 

{0391] The present invention also relates to fiision proteins, and nucleic acids encoding 
thttm, A polypeptide of the invention can be fiised to a heterologous peptide or polypeptide, 
such as N-terminal identification peptides which impart desired characteristics, such as 
increased stability or simplified purification. Peptides and polypeptides of the invention can 
also be synthesized and expressed as fusion proteins with one or more additional domains 
linked thereto for, e.g., producing a more immunogenic peptide, to more readily isolate a 
recombinantly synthesized peptide, to identify and isolate antibodies and antibody- 
expressing B cells, and the like. Detection and purification facilitating domains include, 
e.g., metal chelating peptides such as polyhistidine tracts and histidine-tryptophan modules 
that allow purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp, Seattle WA). The inclusion of a cleavable linker 
sequences such as Factor Xa or enterokinase (Invitrogen, San Diego CA) between a 
purification domain and the motif-comprising peptide or polypeptide to facilitate 
purification. For example, an expression vector can include an epitope-encoding nucleic 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



99 



PCT/US2004/030360 



acid sequence linked to six histidine residues followed by a thioredoxin and an enterokinase 
cleavage site (see e.g., V^lft@>(1995) Biochemistry 34:1787-1797; Dobeli (1998) Protein 
Expr Purif, 12:404-414). The histidine residues facilitate detection and purification while 
the enterokinase cleavage site provides a means for purifying the epitope from the 
remainder of the fusion protein. In one aspect, a nucleic acid encoding a polypeptide of the 
invention is assembled in appropriate phase with a leader sequence capable of directing 
secretion of the translated polypeptide or fragment thereof. Technology pertaining to 
vectors encoding fusion proteins and application of fusion proteins are well disclosed in the 
scientific and patent literature, see e.g., Kroll (1993) DNA Cell BioL 12:441-53, 

|0392] The nucleic acids and polypeptides of the invention can be bound to a solid 
support, e.g., for use in screening and diagnostic methods. Solid supports can include, e.g., 
membranes (e.g., nitrocellulose or nylon), a microtiter dish (e.g., PVC, polypropylene, or 
polystyrene), a test tube (glass or plastic), a dip stick (e.g.; glass, PVC, polypropylene, 
polystyrene, latex and the like), a microfuge tube, or a glass, silica, plastic, metallic or 
polymer bead or other substrate such as paper. One solid support uses a meta (e.g., cobalt 
or nickel)-comprising column which binds with specificity to a histidine tag e gineered 
onto a peptide. 

[0393] Adhesion of molecules to a solid support can be direct (i.e., the molecule conca . ts 
the solid support) or indirect (a "linker" is boimd to the support and the molecule of intexf^st 
binds to this linker). Molecules can be immobilized either covalently (e.g., utilizing single 
reactive thiol groups of cysteine residues (see, e.g., Colliuod (1993) Bioconjugate Chem. 
4:528-536) or non-covalently but specifically (e.g., via immobilized antibodies (see, e.g., 
Schuhmann (1991) ^^iv. Mater. 3:388-391; Lu {1995) Anal Chem. 67:83-87; the 
biotin/strepavidin system (see, e.g., Iwane (1997) Biophys, Biochem. Res. Comm. 230:76- 
80); metal chelating, e.g., Langmuir-Blodgett films (see, e.g., Ng (1995) Langmuir 1 1 :4048- 
55); metal-chelating self-assembled monolayers (see, e.g., Sigal (1996) Anal Chem, 
68:490-497) for binding of polyhistidine fusions, 

[0394] Lidirect binding can be achieved using a variety of linkers which are commercially 
available. The reactive ends can be any of a variety of functionalities including, but not 
Hmited to: amino reacting ends such as N-hydroxysuccinimide (NHS) active esters, 
imidoesters, aldehydes, epoxides, sulfonyl halides, isocyanate, isothiocyanate, and nitroaryl 
halides; and thiol reacting ends such as pyridyl disulfides, maleimides, thiophthalimides. 
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and active halogens. The heterobiflmctional crosslinking reagents have two different 
reactive ends, e.g., an ainino-reactive end and a thiol-reactive end, while homobifunctional 
reagents have two similar reactive ends, e.g., bismaleimidohexane (BMH) which permits 
the cross-linking of sulfliydryl-containing compounds. The spacer can be of varying length 
and be aliphatic or aromatic. Examples of commercially available homobifunctional cross- 
linking reagents include, but are not limited to, the imidoesters such as dimethyl adipimidate 
dihydrochloride (DMA); dimethyl pimelimidate dihydrochloride (DMP); and dimethyl 
suberimidate dihydrochloride (DMS). Heterobifunctional reagents include commercially 
available active halogen-NHS active esters coupling agents such as N-succinimidyl 
bromoacetate and N-succinimidyl (4-iodoacetyl)aminobenzoate (SIAB) and the 
sulfosuccinimidyl derivatives such as sulfosuccinimidyl(4-iodoacetyl)aminobenzoate (sulfo- 
SIAB) (Pierce). Another group of coupling agents is the heterobifunctional and thiol 
cleavable agents such as N-succinimidyl 3-(2-pyridyidithio)propionate (SPDP) (Pierce 
Chemicals, Rockford, XL). 

[0395] Antibodies can also be used for binding polypeptides and peptides of the invention 
to a solid support. This can be done directly by binding peptide-specific antibodies to the 
column or it can be done by creating fusion protein chimeras comprising motif-containing 
peptides linked to, e.g., a known epitope (e.g., a tag (e.g., FLAG, myc) or an appropriate 
immunoglobulin constant domain sequence (an "immunoadhesin," see,, e.g.. Capon (1989) 
Nature 377:525-531 (1989). 

[0396] Nucleic acids or polypeptides of the invention can be immobilized to or applied to 
an array. Arrays can be used to screen for or monitor libraries of compositions (e.g., small 
molecules, antibodies, nucleic acids, etc.) for their ability to bind to or modulate the activity 
of a nucleic acid or a polypeptide of the invention. For example, in one aspect of the 
invention, a monitored parameter is transcript expression of a gene comprising a nucleic 
acid of the invention. One or more, or, all the transcripts of a cell can be measured by 
hybridization of a sample comprising transcripts of the cell, or, nucleic acids representative 
of or complementary to transcripts of a cell, by hybridization to immobiUzed nucleic acids 
on an array, or "biochip.'* By using an "array" of nucleic acids on a microchip, some or all 
of the transcripts of a cell can be simultaneously quantified. Alternatively, arrays 
comprising genomic nucleic acid can also be used to determine the genotype of a newly 
engineered strain made by the methods of the invention. Polypeptide arrays" can also be 
used to simultaneouslv quantify a plurality of proteins. 
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[0397] The terms "array or "microaxray" or ''biochip" or "chip" as used herein is a 
plxirality of target elements, each target element comprising a defined amount of one or 
more polypeptides (including antibodies) or nucleic acids immobilized onto a dejBned area 
of a substrate surface. In practicing the methods of die invention, any known array and/or 
method of making and using arrays can be incorporated in whole or in part, or variations 
thereof, as disclosed, for example, in U.S. Patent Nos. 6,277,628; 6,277,489; 6,261,776; 
6,258,606; 6,054,270; 6,048,695; 6,045,996; 6,022,963; 6,013,440; 5,965,452; 5,959,098; 
5,856,174; 5,830,645; 5,770,456; 5,632,957; 5,556,752; 5,143,854; 5,807,522; 5,800,992; 
5,744,305; 5,700,637; 5,556,752; 5,434,049; see also, e.g., WO 99/51773; WO 99/09217; 
WO 97/46313; WO 96/17958; see also, e.g., Johnston (1998) Curr, Biol 8:R171-R174; 
Schummer (1997) Biotechniques 23:1087-1092; Kern (1997) Biotechniques 23:120-124; 
Solinas-Toldo (1997) Genes, Chromosomes & Career 20:399-407; Bowtell (1999) Nature 
Genetics Supp. 21:25-32. See also published U.S. patent applications Nos. 20010018642; 
20010019827; 20010016322; 20010014449; 20010014448; 20010012537; 20010008765. 

Host Cells and Transformed Cells Comprising Kinase Sequences 

[0398] As indicated above, availability of kinase coding sequences also allows provision 
of a transformed cell cornprising a kinase nucleic acid sequence, e.g., a sequence encoding a 
kinase polypq>tide, or a vector. The host cell may be any of the host cells familiar to those 
skilled in the art, including prokaryotic cells, eukaryotic cells, such as bacterial cells, fungal 
cells, yeast cells, mammalian cells, insect cells, or plant cells. Exemplary bacterial cells 
include E. coliy Streptomyces^ Bacillus subtilis^ Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces^ and Staphylococcus, Exemplary insect cells 
include Drosophila S2 and Spodoptera Sf9. Exemplary animal cells include CHO, COS or 
Bowes melanoma or any mouse or himian cell line. The selection of an appropriate host is 
within the abilities of those skilled in the art. 

[0399] Vectors may be introduced into the host cells using any of a variety of techniques, 
including transformation, transfection, transduction, viral infection, gene guns, or Ti- 
mediated gene transfer. Particular methods include calcium phosphate transfection, DEAE- 
Dextran mediated transfection, lipofection, or electroporation. 

[0400] Engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying the genes of the 
invention. Following transformation of a suitable host strain and growth of the host strain 
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to an appropriate cell density, the selected promoter may be induced by appropriate means 
(e.g., temperature shift or chemical induction) and the cells may be cultured for an 
additional period to allow them to produce the desired polypeptide or fragment thereof. 

[0401] Cells can be harvested by centrifugation, disrupted by physical or chemical means, 
and the resulting cmde extract is retained for further purification. Microbial cells employed 
for expression of proteins can be disrupted by any convenient method, including freeze- 
thaw cycling, sonication, mechanical disraption, or use of cell lysing agents. Such methods 
are well known to those skilled in the art. The expressed polypeptide or fragment can be 
recovered and purified from recombinant cell cultures by methods including ammonium 
sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Protein 
refolding steps can be used, as necessary, in completing configuration of the polypeptide. If 
desired, high performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

[0402] Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS-7 Jin^L: 
of monkey kidney fibroblasts and other cell lines capable of expressing proteins from a 
compatible vector, such as the C127, 3T3, CHO, HeLa and BHK cell lines. 

[0403] The constmcts in host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. Depending upon the host employed in 
a recombinant production procedure, the polypeptides produced by host cells containing the 
vector may be glycosylated or may be non-glycosylated. Polypeptides of the invention may 
or may not also include an initial methionine amino acid residue. 

[0404] Cell-free translation systems can also be employed to produce a polypeptide of the 
invention. Cell-free translation systems can use mRNAs transcribed from a DNA constmct 
comprising a promoter operably linked to a nucleic acid encoding the polypeptide or 
fragment thereof. In some aspects, the DNA constmct may be linearized prior to 
conducting an in vitro transcription reaction. The transcribed mRNA is then incubated with 
an appropriate cell-free translation extract, such as a rabbit reticulocyte extract, to produce 
the desired polypeptide or fragment thereof. 
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[0405] The expression vectors can contain one or more selectable marker genes to provide 
a phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin 
resistance in E. colL 

[0406] For transient expression in mammalian cells, cDNA encoding a polypeptide of 
interest may be incorporated into a mammalian expression vector, e.g. pcDNAI, which is 
available commercially from Invitrogen Corporation (San Diego, Calif., U.S.A.; catalogue 
number V490-20). This is a multifunctional 4.2 kb plasmid vector designed for cDNA 
expression in eukaryotic systems, and cDNA analysis in prokaryotes, incorporated on the 
vector are the CMV promoter and enhancer, splice segment and polyadenylation signal, an 
SV40 and Polyoma vims origin of replication, and Ml 3 origin to rescue single strand DNA 
for sequencing and mutagenesis, Sp6 and T7 RNA promoters for the production of sense 
and anti-sense RNA transcripts and a Col El-like high copy plasmid origin. A polylinker is 
located appropriately downstream of the CMV promoter (and 3* of the T7 promoter). 

[0407] The cDNA insert may be first released from the above phagemid incorporated at 
appropriate restriction sites in the pcDNAI polylinker. Sequencing across the junctions may 
be performed to confirm proper insert orientation in pcDNAI. The resulting plasmid may 
then be introduced for transient expression into a selected mammalian cell host, for 
example, the monkey-derived, fibroblast like cells of the COS-1 lineage (available from the 
American Type Culture Collection, Rockville, Md. as ATCC CRL 1650). 

[0408] For transient expression of the protein-encoding DNA, for example, COS-1 cells 
may be transfected with approximately 8 jxg DNA per 10^ COS cells, by DEAE-mediated 
DNA transfection and treated with chloroquine according to the procedures described by 
Sambrook et al. Molecular Cloning: A Laboratory Manual, 1989, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor N.Y, pp. 16.30-16.37, An exemplary method is as 
follows. Briefly, COS-1 cells are plated at a density of 5 x 10^ cells/dish and then grown for 
24 hours in FBS -supplemented DMEM/F12 medium. Medium is then removed and cells 
are washed in PBS and then in mediimi. A transfection solution containing DEAE dextran 
(0.4 mg/ml), 100,{xM chloroquine, 10% NuSerum, DNA (0.4 mg/ml) in DMEM/F12 
medium is then applied on the cells 10 ml volume. After incubation for 3 hours at 37 *^C, 
cells are. washed in PBS and medium as just described and then shocked for 1 minute with 
10% DMSO in DMEM/F12 medium. Cells are allowed to grow for 2-3 days in 10% FBS- 
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supplemented medium, and at the end of incubation dishes are placed on ice, washed with 
ice cold PBS and then removed by scraping. Cells are then harvested by centrifugation at 
1000 rpm for 10 minutes and the cellular pellet is frozen in liquid nitrogen, for subsequent 
use in protein expression. Northern blot analysis of a thawed aliquot of jfrozen cells may be 
used to confmn expression of receptor-encoding cDNA in cells under storage. 

[0409] In a like manner, stably transfected cell lines can also prepared, for example, using 
two different cell types as host: CHO Kl and CHO Pro5, To constmct these cell lines, 
cDNA coding for the relevant protein may be incorporated into the mammalian expression 
vector pRC/CMV (Invitrogen), which enables stable expression. Insertion at this site places 
the cDNA under the expression control of the c34omegalovirus promoter and upstream of 
the polyadenylation site and terminator of the bovine growth hormone gene, and into a 
vector background comprising the neomycin resistance gene (driven by the SV40 early 
promoter) as selectable marker. 

f0410] An exemplary protocol to introduce plasmids constmcted as described above is as 
allows. The host CHO cells are first seeded at a density of 5x10^ in 10% FBS- 
sAppl&mented MEM medium. After growth for 24 hours, firesh medium is added to the 
plates jLnd three hours later, the cells are transfected using the calcium phosphate-DNA co- 
precipiladi ,n procedure (Sambrook et al, supra). Briefly, 3 ixg of DNA is mixed and 
incubated with buffered calcium solution for 10 minutes at room temperature. An equal 
volume of i>urTered phosphate solution is added and the suspension is incubated for 15 
minutes at room temperature. Next, the incubated suspension is applied to the cells for 4 
hours, removed and cells were shocked with medium containing 15% glycerol. Three 
minutes later, cells are washed with medium and incubated for 24 hours at normal growth 
conditions. Cells resistant to neomycin are selected in 10% FBS-supplemented alpha-MEM 
medium containuig G418 (1 mg/ml). individual colonies of G418-resistant cells are 
isolated about 2-3 weeks later, clonally selected and then propagated for assay purposes. 

EXAMPLES 

EXAMPLE 1 : Cloning of PIM-1 

[04111 The PIM-1 DNA encoding amino acids 1-313 and 29- 313 were amplified from 
human brain cDNA (Clonetech) by PGR protocols and cloned into a modified pET 29 
vector (Novagen) between Ndel and Sail restriction enzyme sites. The amino acid 
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sequences of the cloned DNA were confirmed by DNA sequencing and the expressed 
proteins contain a hexa-histidine sequence at the C terminus. The protein was expressed in 
E. coli BL21(DE3)pLysS (Novagen). The bacteria were grown at 22°C in Terrific broth to 
1-1.2 OD600 and protein was induced by 1 mM IPTG for 16-18 h. The bacterial pellet was 
collected by centrifugation and stored at -VO^'C until used for protein purification. PIM-2 
and PIM-S are cloned similarly. 

EXAMPLE 2: Purification of PIM-1 

[04121 The bacterial pellet of approximately 250-300g (usually firom 16 L) expressing 
PIM-1 kinase domain (29-313) was suspended in 0.6 L of Lysis buffer (0.1 M potassium 
phosphate buffer, pH 8.0, 10 % glycerol, 1 mM PMSF) and the cells were lysed in a French 
Pressure cell at 20,000 psi. The cell extract was clarified at 17,000 rpm in a Sorval SA 600 
rotor for 1 h. The supernatant was re-centrifiiged at 17000 rpm for another extra hour. The 
clear supematant was added with imidazole (pH 8,0) to 5 mM and 2 ml of cobalt beads 
(50% slurry) to each 40 ml cell extract. The beads were mixed at 4°C for 3-4 h on a nutator. 
The cobalt beads were recovered by centrifiigation at 4000 rpm for 5 min. The pelleted 
' beads were washed several times with lysis buffer and the beads were packed on a Byroad 
disposable column. The bound protein was eluted with 3-4 column volumes of 0.1 M 
imidazole followed by 0.25 M imidazole prepared m lysis buffer. The eluted protein was 
analyzed by SDS gel electrophoresis for purity and yield. 

[0413] The eluted protein from cobalt beads was concentrated by Centriprep-1 0 (Amicon) 
and separated on Pharmacia Superdex 200 column (16/60) in low salt buffer (25 mM Tris- 
HCl, pH 8.0, 150 mM NaCl, 14 mM beta mercaptoethanol). The peak fractions containing 
PIM-1 kinase was fiulher purified on a Pharmacia Source Q column (10/10) in 20 mM Tris- 
HCl pH 7.5 and 14 mM beta mercaptoethanol using a NaCl gradient in an AKTA-FPLC 
(Pharmacia). The PIM-1 kinase eluted approximately at 0.2 M NaCl gradient. The peak 
fractions were analyzed by SDS gel electrophoresis and were pooled and concentrated by 
Centriprep 10. The concentrated PIM-1 protein (usually 50-60 A280/ml) was aliquoted into 
many tubes (60ul), flash frozen in Uquid nitrogen and stored at -70^C until used for 
crystallization. The frozen PIM-1 kmase still retained kinase activity as concluded from 
activity assays. PIM-2 and PIM-3 can be purified in the same way with small adjustments 
to conditions, e.g,^ elution conditions. 
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Example 3: Variants and Derivatives of PIM-1 

[0414] In mouse, PIM-1 is expressed as two forms of 44 kDa and 33 kDa. The p44 kDa 
PIM-1 is encoded by the same gene as p33 kDa PIM-1 but the translation is initiated at an 
upstream CUG codon (Saris CJ, Domen J, and Bems A. (1991) The PIM-1 oncogene 
encodes two related protein-serine/threonine kinases by altemative initiation at AUG and 
CUG. EMBO J. 10: 655-664.) This results in expression of p44 PIM-1 having a unique 1 1 
kDa N terminal extension that is followed by the p33 PlM-1 sequence. The p33 kDa PIM-1 
contains almost the entire kinase domain and both p33 and p44 kDa have comparable kinase 
activity and both can prevent apoptosis (Lilly M, Sandholm J, Cooper JJ, Koskinen PJ, and 
Kraft A. (1999) The PIM-1 serine kinase prolongs survival and inhibits apoptosis-related 
mitochondrial dysfunction in part through a bcl-2-dependent pathway. Oncogene., 18: 4022- 
4031). CD40 engagement caused significant increase in the levels of both 33 and 44 kDa 
forms of PIMl in cytoplasmic extracts of WEHI-231 cells (Zhu N, Ramirez LM, Lee RL, 
Magnuson NS, Bishop GA, and Gold MR.(2002) CD40 signaling in B cells regulates the 
expression of the PlM-1 kinase via the NF-kappa B pathway. J Immunol. 168: 744-754). 
Recently it has been shown that the p33kDa form was more strongly associated with Sees- 1 
than the p44 kDa form (Chen XP, Losman JA, Cowan S, Donahue E, Fay S, Vuong BQ, 
Nawijn MC, Capece D, Cohan VL, Rothman P, (2002) PIM serine/threonine kinases 
regulate the stability of Socs-1 protein. Proc Natl Acad Sci U S A., 99:2175-2180). 

[0415] There are no reports of PIM-1 existing in more than one form in human. Analysis 
of PIM-1 gene sequence reveals that the presence of in-jS:ame stop codons block synthesis 
of proteins with N terminal extensions. However, the human PIM-2 gene contains no in- 
frame stop codon, based on the reported DNA sequence. Therefore, alternate initiation at an 
upstream start codon is possible. We have expressed the PIM-2 kinase domain in E.coli and 
purified the protein by the same methods as described for PIM-1 kinase. 

Example 4: Crystallization of PIM-1. 

PIM'I protein crystal growth: 

[0416] All materials were purchased through Hampton Research, Inc. (Laguna Niguel, 
CA) unless otherwise noted.PlM-1 protein @ 7 and 14 mg/ml was screened against 
Hampton Crystal Screen 1 and 2 kits (HSl and HS2) and yielded successful crystals 
growing in at least 10 conditions from HSl alone. Crystals were grown initially using- 

SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



107 



PCT/US2004/030360 



sitting drops against the Hampton screening conditions set in Greiner 96 well CrystalQuick 
crystallization plates with 100 ul reservoir and 1 ul protein -f 1 ul reservoir added per 
platform (1 of 3 available). Conditions from Hampton Screen 1 yielded obvious protein 
crystals in conditions: #2,7,14,17,23,25,29,36,44,and 49. These crystals were grown at 
4*^0, and grew in size to varying dimensions, all hexagonal rod shaped and hardy. 

[04171 Crystals of larger dimensions, 100 iiM wide x 400 uM long, were then grown in 
larger drop volumes and in larger dimension plates. Refined grids were performed with 
both hanging and sitting drop methods in VDX plates (cat. # HR3~ 140) or CrysChem plates 
(cat. # HR3-160). There appeared to be no obvious difference of crystal size or quality 
between the two methods, but there was a preference to use hanging drops to facilitate 
mounting procedures. 

[0418] We proceeded with refining conditions by gridding 4 independent reservoir 
conditions initially obtained firom the screening kits. 

1) HSl # 17 was optimized to 0.2 M LiCl, 0.1 M Tris pH 8.5 and 5%- 15% 

Polyethylene glycol 4000; 

2) HSl # 25 was optimized to 0.4 M - 0.9 M Sodium Acetate trihydrate pH 6.5 and 

0.1 M Imidazole; 

3) HSl # 29 was optimized to 0.2M ~ 0.7 M Sodium Potassium tartrate and 0.1 M 

MES buffer pH 6.5; 

4) HSl # 44 was optimized to 0.25 M Magnesium formate. 

[0419] These optimized conditions produced crystals with the most consistent size and 
quality of appearance. Conditions were further evaluated by x-ray diffraction analysis of 
the resulting protein crystals, and keeping in mind the utility for forming compound co- 
crystals in these conditions as well (ie. salt composition and concentration effects are 
important to develop suitable compound solubility in the crystallization experiments). 
Native crystals grew as rods in many drops to large dimensions of approximately 100 urn 
wide and 500 um long. 

Seleno Methionine labeled PIM-1 protein crystal growth. 

[0420] Se-Met labeled PIM protein was expressed and purified as described by 
Hendrickson, W. A., and Ogata, C. M, (1997) "Phase determination from multiwavelength 
anomalous diffraction measurements, Methods Enzymol^ 276, 494-523, and Hendrickson, 
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W. A., Horton, J. R., and LeMaster, D. M. (1990) "Selenomethionyl proteins produced for 
analysis by multiwavelength anorr ulcus diffraction (MAD): a vehicle for direct 
determination of three-dimentional structure, EMBOJ,, 9, 1665-1672. This preparation 
appeared to be less soluble as evidenced by more pronounced nucleation within the screen 
drops and due to the hydrophobic nature of Se labeled proteins. Crystals grew small and in 
showers compared to the previously evaluated similar drop conditions that the native 
protein grew well m. Upon finer griddhig, 20 ^m wide x 100 |im long crystals were 
obtained in condition HSl # 17 optimized at 0.2 M LiCl, 0,1 M Tris pH 8.5 and 5% -15% 
PEG 4000. These crystals and all others were carefully mounted in 50-100 uM nylon 
loops on copper stem magnetic bases that were flash firozen in Hquid nitrogen in appropriate 
cryogenic buffer and taken to the Lawerence Berkeley Lab synchrotron, the Advanced Light 
Source (ALS) beamline 8.3.1. 

PIM'l protein/Molecular Scaffolds Co-crystal growth: 

[0421] In order to add compounds to PIM-1 protein, compoimds were added directly from 
their DMSO stocks (20-200 mM) into the protein solution at high concentration. The 
procedure involved adding the DMSO stocks containing compound as a thin layer to the 
wall of the 1 .5 ml eppendorf tube that contains the protein. The solution was then gently 
rolled over the wall of the tube until the compound was in the protein solution. The final 
concentration of compounds in the PIM-1 solution usually achieved was between 0.5 and 1 
mM with DMSO concentrations less than 2% being added. The solutions were then set-up 
in trays immediately as previously described. 

PIM"! /Compound Co-crystal Screening in HSl: 

[0422] Two conditions for crystal growth have resulted in the best results with PIM-1 
protein and added compounds. The optimized Na-K tartrate and Na-acetate tetrahydrate 
solutions listed above. Crystals varied greatly in size but data has been collected on various 
crystals that are between 20 uM and 100 uM in width. These crystals were typically several 
hundred microns long and some required manipulation as well as being broken to facilitate 
mounting procedures into loops. Interestingly, some crystals that were grown in the 
presence of colored compounds were also colored the same way. 
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Examples: Diffraction^^^^g^is of PIM-1. 

[0423] Crystals were first determined to diffract on a Rigaku RU-200 rotating copper 
anode x-ray source equipped with Yale focusing optics and an R-AXIS 2C imaging plate 
system. A crystal grown in the optimized condition HSl # 17 (DY plate 12/14/01) was used 
to conduct initial diffraction experiments. 

[0424] After x-ray diffraction was initially determined as described above, large native 
protein crystals grown in Mg-Fomiate (DY plate) and were frozen in cryoprotectant by 
submersion in liquid nitrogen and then tested for diffraction at ALS beamline 83,1. Data 
was originally collected, indexed and reduced using Mosflm. The spacegroup was 
determined to be P65. 

[0425] We have collected 3 native data sets, the highest resolution obtained with good 
statistics after merging is to 2.0 angstroms, 

[0426; We have collected a MAD data set on the Se-Met labeled PIM-1 crystal using the 
experii lentally determined 12668 eV peak and 1 1000 eV remote for selenium to 3.2 
angstronis. Subsequently a 2.6 angstrom Se peak data set was collected at the 
experimentaU> d Jtermined peak of 12668 eV radiation. 

[0427] We ha^ e collected more than 50 PIM-l/binding compound co-crystal data sets. 
All data was indexed and reduced as indicated in the computational . crystallographic work 
that follows. 

PIM-1 Structure Determination and Refinement 

Data set: Native, Resolution: 2.13 A 

[0428] The primary structure determination was carried out using Molecular Replacement 
method with programs 

EPMR (Public domain) 

AmoRe (from CCP4)) 

And a homology model of PIM-1 based on the protein Phosphorylase Kinase 
(PDB ID: IPHK- Owen et aL, 1995, Structure 3:467) 
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[0429] The molecular replacement was carried out in all of the P6 space groups (P61, P62, 
. . . P65). The best solution was obtained in P65. 

[0430] The molecular replacement solution was improved by several rounds of the cycles 
of 

Model Building in O (from DatOno AB) 
Annealing in CNX (from Accelerys) 

SigmaA weighting and Solvent Flattening the resultant map with DM (from 
CCP4) 

[0431] The statistics at the end of these cycles were R ~ 36 %. 

Data set: SeMet (2 wavelengths). Resolution: 3.3 A 

[0432] The MAD phased data (with SOLVE (from Los Alamos National Laboratory)) 
helped improve the model in the refinement with REFMAC (from CCP4). 

Data set: SeMet (1 wavelength). Resolution: 2.6 A 

[0433] Further improvement of the model was obtained using SAD Phasing with SOLVE 
and subsequent improvement with RESOLVE produced an excellent map into which the 
PIMl model could be rebuilt completely. 

[04341 The newly built model refined with CTslX/Anneal and then with CCP4/Refimac to 
give R = 27.7% and Rfree = 3 L9 % 

Data set: Native, Resolution: 2.1 A 

[0435] The above model has been ftirther refined against the native data with 
CCP4/Refinac, giving R = 22. 1 %, Rfree = 24.2 %. 

Example 6: Co-Crystal Structures 

[0436] Exemplary co-crystal stmctures have been determined for 7 compounds with PIM- 
1, using methods as generally described above. Those co-crystals are the following (the 
number indicates the compound id and the compoimd source is provided in parentheses): 
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PIM1_5321980 (Chembridge) 
PIM1_RB00137 (Maybridge) 
PIM1_5264241 (Chembridge) 
PIM1_RJF00907 (Maybridge) 
P]M1_5 140994 (Chembridge) 
P]M1_5 108305 (Chembridge) 
PIM1_BTB02713 (Maybridge) 

Example 7: PIM Binding Assays 

[0437] Such binding assays can be performed in a variety of ways, including a variety of 
ways known in the art. For example, competitive binding to PIM-1 can be measured on 
Nickel-FlashPlates, using His-tagged PIM-1 100 ng) and ATPyC^^S] 10 nCi). As 
compound is added, the signal decreases, since less ATPy[^^S] is bound to PIMl which is 
proximal to the scintillant in the FlashPlate. The binding assay can be performed by the 
addition of compoimd (10 20 mM) to PIM-1 protein (90 10 p.1) followed by the addition 
of ATPy[^^S] and incubating for 1 hr at 37°C- The radioactivity is measured through 
scintillation counting in Trilus (Perkin-Elmer). 

[0438] Alternatively, any method which can measure binding of a ligand to the ATP- 
binding site can be used. For example, a fluorescent ligand can be used. When bound to 
PIMl, the emitted fluorescence is polarized. Once displaced by inhibitor binding, the 
polarization decreases. 

[0439] Determination of ICS 0 for compounds by competitive binding assays. (Note that 
Ki is the dissociation constant for inhibitor binding; Kd is the dissociation constant for 
substrate binding.) For this system, the IC50, inhibitor binding constant and substrate 
binding constant can be interrelated according to the following formula: 

[0440] When using radiolabeled substrate Ki = IC50 

1+ [L*]/Kd 

the IC50 ~ Ki when there is a small amount of labeled substrate. 
Exan^ple 8: PIM Activity Assays 

[0441] Inhibitory or exhitory activity of compounds binding to PIM-1 was determined 
using the kinase activity assay described in the detailed description. 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



112 



PCT/US2004/030360 



[0442] Exemplary compounds within Fomiula I, Fomiula II, and Formula in were 
assayed for inhibitory activity with PIM-L The ability to develop ligands is illustrated by 2 
compounds from the quinolinone molecular scaffold group (Formula III). A compound 
with Rl, R2, R3, R4, R5, and R6 = H, had 100% inhibition of PIM-1 at 200 |liM 
concentration, while a compound with Rl = phenyl group, R2, R3, R5, and R7 = H, and R4 
= OCF3, had only 3% inhibition of PIM-1 at 200 nM. 

Example 9: Synthesis of the Compounds of Formula I: 



R3 




Formula I 



1. Synthesis of the Compounds of Formula I - Scheme 1 




Step 2 



(1) (2) (3) 




[0443] The 2-arylbenzimidazoles derivatives, represented by Formula I, can be prepared 
as shown in Scheme 1. 
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Step 1 Preparation of formula (3) 

[0444] Compound of formula (3) was prepared by a reaction of a compound of formula 
(1) with an alkyl halide of formula (2)( e.g. X = Br) in an inert solvent (e.g. DMF), in the 
presence of a base (e.g. NaH) and heated near 100 for 24-36 h. Compoxmd of formula 
(3) was purified by column chromatography. 

Step 2 Preparation of formula (4) 

[0445] Compound of formula (4) was prepared by a reaction of a compound of formula 

(3) with a reducing agent (e.g. SnCla) in a polar solvent (e.g. EtOH)and heated near 100 ^C 
for 5-12 h. When the reaction is completed, the product of formula (4) is isolated by 
conventional means (e.g. aqueous base workup). 

Step 3 Preparation of formula J 

[0446] Compound of formula I was prepared by the reaction of the compound of formula 

(4) with an imidate of formula (5) in a polar solvent (e.g. EtOH) and heated near SO'^C for 
12-18 h. Compound of formula I was purified by colunm chromatography. 



2. Alternate synthesis of the Compounds of Formula I - Scheme 2 




(4) (6) Formula I 



Step I Preparation of formula I 

[0447] Compound of formula I was prepared by the reaction of the compound of formula 
(4) with an aldehyde of formula (6) in a polar solvent (e.g. EtOH) and heated near SO^'C for 
12-22 h. Compound of formula I was purified by column chromatography. 

3. Alternate synthesis of the Compounds of Formula I - Scheme 3 
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Step 1 Preparation of formula I 

[0448] Compound of formula I was prepared by the reaction of the compound of formula 
(4) with an aldehyde of formula (7) in an inorganic acid (e.g. polyphosphoric acid) and 
heating near 190-220 °C for 5-12 h. Compound of formula I was purified by column 
chromatography. 



Example 10: Synthesis of the Compounds of Formula H: 

a— b 

Formula II 

[0449] Synthesis schemes for exemplary groups of compounds within Formula II are 
provided below. Persons skilled in chemical synthesis will readily understand how to 
synthesize additional compounds within Formula 11. 

1. Synthesis of the Compounds of Formula Ila (where a and b are nitrogen, d 
is sulfur, and c is CR") - Scheme 4 




(8) (10) 

Step 2 ^^^^y^^^R^ 
" N-N 

Formula Ila 
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[0450] The thiadiazole derivative, represented by formula Ila, can be prepared as shown 
in Scheme 4. 

Step-l Preparation of formula (10) 

[0451] Compound of formula (1 0) was prepared by reaction of a compound of formula (8) 
(e.g. m-toluic hydrazide) with an isothiocyanate of formula (9), in a basic solvent (e.g. 
pyridine), typically heated near 65 ""C for 2-6 hours. 

Step'2 Preparation of formula JIa 

[0452] Compound of formula Ha was prepared by dissolving a compound of formula (10) 
in POCI3 and heated near 80 °C for 2 -4 hours. When the reaction is substantially complete, 
the product of formula Ha is isolated by conventional means (e.g. reverse phase HPLC). 

2. Synthesis of the Compounds of :^f>rmula lib (where a and b are nitrogen, d 
is oxygen, and c is CR") - Scheme 5 




Formula lib 



[0453] The oxadiazole derivatives, represented by formula lib, can be prepared as shown 
in Scheme 5. 

Step'l Preparation of formula (10) 

[04541 Compound of fomiula (10) was prepared by reaction of a compound of formula 
(8), e.g. m-toluic hydrazide, with an isothiocyanate of formula (9), in a basic solvent (e.g. 
pyridine), typically heated near 65 ""C for 2-6 hours. 
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Step-2 Preparation of formula lib 

[0455] Compound of fonnula lib was prepared by dissolving a componnd of formula (10) 
in SOCI2 and heated near 80 °C for 2 -4 hours. When the reaction is substantially complete, 
the product of formula n was isolated by conventional means (e.g. reverse phase HPLC). 



Example 1 1 : Synthesis of the Compounds of Formula ffla, where n = 0 : 




Formula Ilia 

1. Synthesis of the Compounds of Formula Ilia - Scheme 6 




Formula lllaa Formula lllab 

where the squiggly lines indicate a mixture of stereoisomers. 



[0456] The Pyridazinone derivatives, represented by formula Ilia, can be prepared as 
shown in Scheme 6. 
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Step-] Preparation of formula (12) 



[0457] The compound of formula (12) was prepared by reaction of a 4-oxo-butyric acid 
derivative of formula (11) with hydrazine, in a basic solvent (e.g. pyridine), typically heated 
near 65 °C for 2-6 hours. 



[0458] The compound of formula Illaa was prepared by dissolving a compound of 
formula (12) in an inert solvent and adding an oxidizing agent (e.g. Bra, chloranil, or Pd(C) 
under an air atmosphere. When the reaction is substantially complete, the product of 
formula IHaa is isolated by conventional means (e.g. reverse phase HPLC). 



[0459] Compounds of Formula Illaa (X = 0) can be converted into compounds where X = 
S, by treatment with Lawesson's reagent or P2S5 stirred in an inert solvent at ambient 
temperature for 2-6 hours. When the reaction is substantially complete, the product of 
formula Illb is isolated by conventional means (e.g. reverse phase HPLC). 

[0460] Synthesis of the compounds of Formula Illb, where n = 1: 



Step-2 Preparation of formula Illaa 



Step-S Preparation of Formula Illab 




Formula lilb 
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2. Synthesis of Compounds of Formula Illb (Illba and Ilibb) - Scheme 7 




Formula llibb 

[0461] The Pyridazinone derivatives, represented by formula Illba, can be prepared as 
shown in Scheme 7. 

Step'l Preparation of formula Illba 

[0462] The compound of formula Illba was prepared by reaction of an acyl-acrylic acid of 
formula (13) with hydrazine, in a basic solvent (e.g. aqueous NaOH), typically stirred at 
ambient temperature for 2-6 hours. When the reaction is substantially complete, the product 
of formula Ilia (X = 0) is isolated by conventional means (e.g. reverse phase HPLC). 

Step'2 Preparation of Formula Illbb 

[0463] Compounds of Formula Illba (X == 0) can be converted into compounds where X = 
S, by treatment with Lawesson's reagent or P2S5 stirred in an inert solvent at ambient 
temperature for 2-6 hours. When the reaction is substantially complete, the product of 
formula Illbb is isolated by conventional means (e.g. reverse phase HPLC). 
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Example 12: Synthesis of the Compounds of Formula IV: 

OH 




Formula IV 



1. Synthesis of the Compounds of Formula IVa, where IS? is Methyl - Scheme 8 



OH 




OH 



Step1 



N SH 
(14) 




N 



OH 



(15) 



(16) 



Step 3 



(17) 



(18) 



Steps 



OH 



N' 



.Bn 



o o 



Bn 



Step 



(19) 



(20) 



Formula IVa 



[0464] The 4-hydroxYpyrimidine derivatives, represented by formula IVa, can be prepared 
as shown in Scheme 8, 



Step'l Preparation of formula (15) 

[0465] Compound (1 5) was prepared by reaction of compound (14) with benzyl bromide 
in an inert solvent (e.g. DMF), in the presence of a base (e.g. EtaN), at room temperature for 
4 hours. Product (15) is purified by a conventional way (e.g. recrystallization). 
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Step-l Preparation of formula (16) 

[0466] Compoimd (16) was prepared by reaction of compound (1 5) with N- 
iodosuccinimide in chloroform under reflux for several hours. Product (16) was purified by 
a conventional way (e.g. recrystallization). 

Step-3 Preparation of formula (17) 

[0467] Compound (17) was prepared by reaction of compound (16) with benzyl bromide 
in an inert solvent (e.g. DMF), in the presence of a base (e.g. NaH) at room temperature for 
several hours. 

Step4 Preparation of formula (18) 

[0468] The compound of formula (1 8) was prepared by reaction of compound (17), with a 
suitable reagent for coupUng reaction v^.g. phenylboronic acid) in a suitable mixture of 
solvent (e.g. dimethoxyethane and W£ e ), ir?, the presence of a base (e.g. K2CO3), typically 
heated to 100 for several hours. The pr oduct was isolated by a conventional way (e.g. 
flash chromatography). 

Step5 Preparation of formula (19^ 

[0469] Th& compound of formula (19) was prepared by reaction of a compound of 
formula (18), with an oxidizing reagent (e.g. MCPBA) in a suitable solvent (e.g. CH2CI2), 
typically at room temperature for a few hours. 

Step6 Preparation of formula (20) 

[0470] The compound of fomiula (20) was prepared by reaction of a compound of 
formula (19) with a nucleophilic reagent (e.g. piperazine), in the presence of a base (e.g. 
CS2CO3) in a suitable solvent (e.g. dioxane) reflux for several hours. The product is purified 
in a conventional way (e.g. flash chromatography). 

Step? Preparation of formula IV 

[0471] The compound of formula IVa was prepared conventionally by hydrogenation of a 
compoimd of formula (20) in the presence of a catalyst (e.g. Pd(OH)2/C), under hydrogen in 
a suitable solvent (e.g. methanol) at room temperature for several hours. 
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2. Synthesisof the Compounds Of Formula IVb- Scheme 9 




[0472] The 4-hydroxypyrimadine derivatives, represented by formula IV, can be prepared 
as shown in Scheme 9. 

Stepl Preparation of formula IVb 

[0473] The compound of formula IVb was prepared by reaction of a compound of formula 
(21) with an electrophilic reagent (phenyl isocyanate), in the presence of a base (e.g. EtsN) 
in a suitable solvent (e.g. CH2CI2) at room temperature for several hours. When the reaction 
is substantially complete, the product of formula IVb was isolated by conventional means 
(e.g. flash chromatography). 

3: Synthesis of the Compounds of Formula IVc - Scheme 10 




[0474] The 4-hydroxypyrimidine derivatives, represented by formula IVc, were prepared 
as shown in Scheme 10. 
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Step'l Preparation of formula (23) 

[04751 The compound of formula (23) was prepared by reaction of a compound of 
formula (22), with an alkyl halide reagent (e.g. methyl iodide) in an inert solvent (e.g. 
DMF), in the presence of a base (e.g. K2CO3), typically heated near 80 °C for 12-36 hours. 

Step-2 Preparation of formula TVc 

[0476] The compound of formula (25) was prepared y by reaction of a compound of 
formula (23) with an alkylisothiourea (e.g. S-methylisothiourea, H2NCNHSCH3), while 
heating in a suitable solvent (e.g. ethanol) at 75 for several hours. When the reaction is 
substantially complete, the product of formula IVc was isolated by conventional means; for 
example, recrystallization. 

4. Alternate synthesis of the Compounds of Formula IV - Scheme 11 




Step 2 



Step'l Preparation of formula (28) 

[0477] Compound of formula (28) was prepared conventionally by reaction of a 
compound of formula (26), with thiourea of formula (27), in a polar solvent (e.g. ethanol), 
and typically heated near 80 for 12-36 hours and isolating the product by column 
chromatography. 
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Step''2 Preparation of formula (29) 

[0478] Compound of formula (29) was prepared by reaction of a compound of formula 
(28) with an alkylating agent (e.g. ethylbromoacetate), in the presence of a base (e.g. 
K2CO3) in a suitable solvent (e.g. acetonitrile) at room temperature or reflux for several 
hours. When the reaction is substantially complete, the product of formula IVc was isolated 
by conventional means. 

Example 13: Synthesis of the Compounds of Formula V: 





r3 


r2 












R^ 



Formula V 



1. Synthesis of the Compounds of Formula Va, where R^, R^, and R^ are 
Methyl; R** and R^are Hydrogen - Scheme 12 




[0479] The isoindole derivatives, represented by formula Va, can be prepared as shown in 
Scheme 12. 
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Step-1 Preparation of Formula Va 

[0480] The compound of formula (30) can be reacted with diketone compounds of 
formula (31), in aqueous acetic acid and typically heated near 100 °C for 12-36 hours. The 
compound can be isolated by conventional methods (e.g. recrystalUzation). 

2. Synthesis of the Compounds of Formula Vb, where R^, R^, R"* and R^are 
Hydrogen; R^ is hydroxy - Scheme 13 




(32) (33) Fomiula Vb 



Step-1 Preparation of Formula Vb 

[0481] The compound of Formula Vb is prepared conventionally by reaction of a 
compound of formula (32) with a Grignard reagent (e.g. Phenyl magnesium bromide), in a 
suitable solvent (e.g. benzene) and refluxed for 1 hour. When tihie reaction is substantially 
complete, the product of is isolated by conventional means (e.g., column chromatography). 

3. Synthesis of the Compounds of Formula Vc, where where R^ R^ R^, R^ R^ 
and R^ are Hydrogen - Scheme 14 





N3 

(34) (35) Formula Vc 



Step-1 Preparation ofVc 

[0482] The compound (34) and the ammonium tetrathiomolybdate of formula (35) are 
reacted at room temperature with aqueous CH3CN as the solvent; When the reaction is 
substantially complete (typically 1 hour), the solvent is removes by evaporation at reduced 
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pressure. The product Vc is isolated by conventional means (e.g. ether extraction). 
(Ramesha, et. ah, J. Org, Chem., 1995,60, 7682-7683; and references therein). 

Example 14: Synthesis of the Compounds of Formula VI: 




Formula VI 



1. Synthesis of the Compounds of Formula Via, where R"* = H - Scheme 15 




[04831 Pyrazolidinone derivatives, repro iented by formula VI, can be prepared as shown 
inScheme-15. 

Step-1 Preparation of formula (37) 

[0484] The compound of formula (37) is prepared conventionally by reaction of a 
compound of formula (36), with an alkyl halide reagent (e.g. methyl iodide) in an inert 
solvent (e.g. DMF), in the presence of a base (e.g. K2CO3), typically heated near 80 ""C for 
12-36 hours. 

Step''2 Preparation of formula Via 

[0485] Compoimd of formula (37) is reacted with hydrazine of formula (38) along with a 
catalytic amount of concentrated acid (e.g, hydrochloric acid) at 120 °C. The liberation of 
alcohol and water results in the formation of the product of formula Via, can be isolated 
through conventional means, for example column chromatography. 
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2. Alternate synthesis of the Compounds of Formula VI, where R'^^H: - 
Scheme 16 



O 

Formuia Via 

Step-1 Preparation of formula VIb 

[0486] Compoiuid of formula (36) is reacted with hydrazine of formula (38) along with a 
catalytic amount of concentrated acid (e.g. hydrochloric acid) at 120 °C. The liberation of 
alcohol and water results in the fomiation of the product of formula VIb, can be isolated 
through conventional means, for example column chromatography. 

Step'2 Preparation of formula Via: 

10487] The compound of formula VI is prepared conventionally by reaction of a 
compound of formula (42) with an alkylating agent of formula (43) (e.g. 
ethylbromoacetate), in the presence of a base (e.g. K2CO3) in a suitable solvent (e.g. 
acetonitrile) at room temperature or reflux for several hours. When the reaction is 
substantially complete, the product of Formula VI is isolated by conventional means; for 
example, recrystallization. 



O 



A 



(36) (38) Formula VIb 




Example 15: Synthesis of the Compounds of Formula VII: 
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Formula VII 



1. Synthesis of the Compounds of Formula VII - Scheme 17 




[0488] The 8-aminoquinolme derivatives, represented by formula VII, can be prepared as 
shown in Scheme- 17. In tliis particular scheme, R^, R^, and R^ are hydrogen. 

Step-l Preparation of formula (41) 

[0489] The compound of formula (41) can be prepared via a Skraup reaction by reaction 
of a compound of formula (39) (e.g. 2-mtroaniline), with glycerol (40), in the presence of 
sulfuric acid in an inert solvent (e,g. dioxane) and typically heated to > 100 for 2-24 
hours. The addition of nitrobenzene or arsenic oxide can aid the reaction (Claus and 
Schoeller, J. Prakt, Chem. 1893, 48, 140. Mosher, H.S. eU al. Org, Syn, CV 3, 568). 

Step'2 Preparation of formula (42) 

[0490] The compound of formula (42) can be prepared conventionally by reaction of a 
compound of formula (41) with a reducing agent (e.g. hydrogen gas, ammonium formate, 
HCO2NH4), in the presence of a catalyst (e.g. Pd/C), in a suitable solvent (e.g. methanol) at 
room temperature for several hours. When the reaction is substantially complete, the 
product of formula (41) can be isolated by conventional means, (e.g. filtration through 
Celite). 
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Step-3 Preparation of formula VII 

[0491] The compound of formula (42) can be reacted with a compound of formula (43) 
where X is a leaving group (e.g. bromide, chloride) or an electrophilic substituent (e.g. 
isocyanate, isothiocyanateX in the presence of base (e.g. K2CO3), in an inert solvent (e.g. 
DMF). Representaive examples of compounds of formula (43) include benzoyl chloride, 
benzenesulfonyl chloride, 3-bromo-2-methylpropane, benzyl bromide, phenyl isocyanate, 
and phenyl isothiocyanate. When the reaction is substantially complete, the product of 
formula VII can be isolated by conventional means (e.g. silica gel chromatography). 

2. Alternative Exemplary Synthesis of Compounds of Formula VII - Scheme 18 




Fomiula VH 



[0492] The 8-aminoquinoline derivatives, represented by formula VII, can be prepared as 
shown in Scheme-18. 

Step'l Preparation of formula (41) 

[0493] The compound of fonnula (41) can be prepared via a Friedlander synthesis by 
reaction of a compound of formula (44) (e.g. 2-amino-3-nitro-acetophenone), with a 
compound of formula (45) (e.g. acetone), in the presence of base (e.g. potassium hydroxide, 
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piperidine) in an inert solvent (e.g. ethanol) and possible with heating for 2-24 hours 
(Eckert, K. Angew, Chem. Int. Ed. 1981, 20, 208). 

Step-2 Preparation of formula (42) 

[0494] The compound of formula (42) can be prepared conventionally by reaction of a 
compound of formula (41) with a reducing agent (e.g. hydrogen gas, ammonium formate, 
HCO2NH4), in the presence of a catalyst (e.g. Pd/C), in a suitable solvent (e.g. methanol) at 
room temperature for several hours. When the reaction is substantially complete, the 
product of formula (42) can be isolated by conventional means, (e.g. filtration through 
Celite). 

Steps Preparation of formula VJI 

[0495] The compound of formula (42) can be reacted with a compound of formula (48) 
where X is a leaving group (e.g. bromide, chloride) or an electrophilic substituent (e.g. 
isocyanate, isothiocyanate), in the presence of base (e.g. K2CO3), in an inert solvent (e.g. 
DMF). Representaive examples of compounds of formula (43) include benzoyl chloride, 
benzenesulfonyl chloride, 3-bromo-2-methylpropane, benzyl bromide, phenyl isocyanate, 
and phenyl isothiocyanate. When the reaction is substantially complete, the product of 
formula VII can be isolated by conventional means (e.g. silica gel chromatography). 

Example 16: Site-directed Mutagenesis of Kinases 

[0496] Mutagenesis of kinases, e.g. PIM kinases, such as the P123M mutation of PIM-I 
can be carried out according to the following procedure (or other procedures available 
persons performing molecular biological techniques) as described in Molecular Biology: 
Current Innovations and Future Trends. Eds. A.M. Griffin and H.G. Griffin. (1995) ISBN 1- 
898486-01-8, Horizon Scientific Press, PO Box 1, Wymondham, Norfolk, U.K., among 
others. 

[0497] In vitro site-directed mutagenesis is an invaluable technique for studying protein 
structure-function relationships, gene expression and vector modification. Several methods 
have appeared in the literature, but many of these methods require single-stranded DNA as 
the template. The reason for this, historically, has been the need for separating the 
complementary strands to prevent reannealing. Use of PGR in site-directed mutagenesis 
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accomplishes strand separation by using a denaturing step to separate the complementing 
strands and allowing efficient polymerization of the PGR primers. PGR site-directed 
methods thus allow site-specific mutations to be incorporated in virtually any double- 
stranded plasmid; eliminating the need for M13-based vectors or single-stranded rescue. 

[0498] It is often desirable to reduce the number of cycles during PGR when performing 
PCR-based site-directed mutagenesis to prevent clonal expansion of any (undesired) 
second-site mutations. Limited cycling which would result in reduced product yield, is 
offset by increasing the starting template concentration. A selection is used to reduce the 
number of parental molecules coming through the reaction. Also, in order to use a single 
PGR primer set, it is desirable to optimize the long PGR method. Further, because of the 
extendase activity of some thermostable polymerases it is often necessary to incorporate an 
end-polishing step into the procedure prior to end-to-end ligation of the PCR-generated 
product containing the incorporated mutations in one or both PGR primers. 

[0499] The following xoUocol provids as a facile method for site-directed mutagenesis 
and accomplishes the abo\ desired features by the incorporation of the following steps: (i) 
increasing template concentrallor approximately 1000-fold over conventional PGR 
conditions; (ii) reducing the numl rr of cycles from 25-30 to 5-10; (iii) adding the restriction 
endonuclease Dpnl (recognition t3;get sequence: 5-Gm6ATG-3, where the A residue is 
methylated) to select against parental DNA (note: DNA isolated fi-om almost all common 
strains ofE, coli is i)am-methylated at the sequence 5-GATC-3); (iv) using Taq Extender in 
the PGR mix for increased reliability for PGR to 10 kb; (v) using Pfu DNA polymerase to 
polish the ends of the PGR product, and (vi) efficient intramolecular ligation in the presence 
of T4 DNA Hgase. 

[0500] Plasmid template DNA (approximately 0.5 pmole) is added to a PGR cocktail 
containing, in 25 ul of Ix mutagenesis buffer: (20 mM Tris HGl, pH 7.5; 8 mM MgG12; 40 
ug/ml BSA); 12-20 pmole of each primer (one of which must contain a 5 -prime phosphate), 
250 uM each dNTP, 2.5 U Taq DNA polymerase, 2.5 U of Taq Extender (Stratagene). 

[0501] The PGR cycling parameters are 1 cycle of: 4 min at 94 G, 2 min at 50 G and 2 min 
at 72 G; followed by 5-10 cycles of 1 min at 94 G, 2 min at 54 G and 1 min at 72 G (step 1). 

[0502] The parental template DNA and the linear, mutagene^^jiMlfcer incorporating 
newly synthesized DNA are treated with Dpni (10 U) and Pfu DNA polymerase (2.51J). 
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This results in the Dpnl digestion of the in vivo methylated parental template and hybrid 
DNA and the removal, by Pfu DNA polymerase, of the Tag DNA polymerase-extended 
base(s) on the linear PGR product. 

[0503] The reaction is incubated at 37 C for 30 min and then transfeixed to 72 C for an 
additional 30 min (step 2). 

[0504] Mutagenesis buffer (Ix, 1 1 5 ul, contaiiaing 0.5 mM ATP) is added to the Dpnl- 
digested, Pfu DNA polymerase-polished PGR products, 

[0505] The solution is mixed and 10 ul is removed to a new microfage tube and T4 DNA 
ligase (2-4 U) added. 

[0506] The ligation is incubated for greater than 60 min at 37 C (step 3). 

[0507] The treated solution is transformed into competent E. coli (step 4). 

[0508] In addition to the PCT-based site-directed mutagenesis described above, other 
methods are available. Examples include those described in Kuxikel (1985) Proc. Natl 
Acad, ScL 82:488-492; Eckstein et aL (1985) Nucl Acids Res. 13:8764-8785; and using the 
GeneEditor™ Site-Directed Mutageneis Sytem from Promega. 



[0509] All patents and other references cited in the specification are indicative of the level 
of skill of those skilled in the art to which the invention pertains, and are incorporated by 
reference in their entireties, including any tables and figures, to the same extent as if each 
reference had been incorporated by reference in its entirety individually. 

[0510] One skilled in the art would readily appreciate that the present invention is well 
adapted to obtain the ends and advantages mentioned, as well as those inherent therein. The 
methods, variances, and compositions described herein as presently representative of 
preferred embodiments are exemplary and are not intended as limitations on the scope of 
the invention. Changes therein and other uses will occur to those skilled in the art, which 
are encompassed within the spirit of the invention, are defined by the scope of the claims. 

[0511] It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from the 
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scope and spirit of the invention. For example, variations can be made to ciystallization or 
co-crystallization conditions for PIM proteins. Thus, such additional embodiments are 
within the scope of the present invention and the following claims. 

[0512] The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms "comprising", 
"consisting essentially of and "consisting of* may be replaced with either of the other two 
terms. The terms and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that in the use of such terms and 
expressions of excluding any equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are possible within the scope of the 
invention claimed. Thus, it should be imderstood that although the present invention has 
been specifically disclosed by preferred embodiments and optional features, modification 
and variation of the concepts herein disclosed may be resorted to by those skilled in the art, 
and that such modifications and variations are considered to be within the scope of this 
invention as defined by the appended claims. 

[051 3 J In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize that 
the invention is also thereby described in terms of any individual member or subgroup of 
members of the Markush group or other group, 

[0514] Also, unless indicated to the contrary, where various numerical values are provided 
for embodiments, additional embodiments are described by taking any 2 different values as 
the endpoints of a raxige. Such ranges are also within the scope of the described invention. 

[051 5) Thus, additional embodiments are within the scope of the invention and within the 
following claims. 
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REFINEMENT - 
PROGRAM 
AUTHORS 



REFMAC 5.1.19 
MURSHUDOV,VAGIN, DODSON 



REFINEMENT TARGET : MAXIMUM LIKELIHOOD 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF (SIGMA{F)) 
COMPLETENESS FOR RANGE (%) 
NUMBER OF REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING + TEST SET) 

R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE' (%) 
FREE R VALUE TEST SET COUNT 

FIT IN THE HIGHEST RESOLUTION BIN, 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE 
BIN RESOLUTION RANGE 
REFLECTION IN BIN 
BIN R VALUE 
BIN FREE R VALUE SET 
BIN FREE R VALUE 



. 00 
.52 



2, 
84 . 
NONE 
99.27 

28693 



THROUGHOUT 

RANDOM 

0.22119 

0.22012 
0.24194 
5.0 
1498 



HIGH 
LOW 

(WORKING SET) 
(WORKING SET) 
COUNT 



20 
2.000 
2.052 
2096 
0.344 

102 
0.359 



NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT. 
ALL ATOMS : 2382 



B VALUES. 
FROM WILSON PLOT (A**2) 
MEAN B VALUE (OVERALL, A**2) 

OVERALL ANISOTROPIC B VALUE. 



NULL 
49.236 



Bll 
B22 
B33 
B12 
B13 
B23 



(A^*2) 
(A**2) 
(A**2) 
(A**2) 
(A**2> 
(A**2) 



32 
32 
99 
66 
00 
00 



ERROR. 



ESTIMATED OVERALL COORDINATE 
ESU BASED ON R VALUE 
ESU BASED ON FREE R VALUE 
ESU BASED ON MAXIMUM LIKELIHOOD 
ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD 

CORRELATION COEFFICIENTS, 

CORRELATION COEFFICIENT FO-FC : 0.954 

CORRELATION COEFFICIENT FO-FC FREE : 0.947 



(A) 
(A) 
(A) 
(A**2> 



0.158 
0.142 
0.127 
4.758 



RMS DEVIATIONS FROM IDEAL VALUES 


COUNT 
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WEIGHT 


BOND LENGTHS REFINED ATOMS 


(A) 


: 2296 




0 


.011 




0.021 


BOND ANGLES REFINED ATOMS 


(DEGREES) 


: 3114 


/ 


1 


.088 


/ 


1.945 


TORSION ANGLES, PERIOD 1 


(DEGREES) 


: 273 


! 


3 


.838 


} 


5.000 


CHIRAL-CENTER RESTRAINTS 


(A**3) 


: 332 


! 


0 


.081 


r 


0.200 


GENERAL PLANES REFINED ATOMS (A) ; 


1784 


t 


0 


.004 


r 


0.020 


NON-BONDED CONTACTS REFINED 


ATOMS (A) ' 


1094 




0 


.215 


r 


0.200 
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SYMMETRY VDW REFINED ATOMS 
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REFINED ATOMS 


(A* 
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97.604 2.623 1.00 75.49 C 

97.043 2.567 1.00 75.51 O 

97.747 3.756 1.00 73.43 N 

97.129 5.004 1.00 71.44 C 

95,931 5.373 1.00 70.49 C 

94.919 4.255 1.00 67.67 C 

93.929 4.067 1.00 65.14 C 

93.019 3.032 1,00 63.31 C 

93.087 2.177 1.00 62.94 C 

92.191 1.151 1.00 64.24 O 

94.054 2.339 1.00 63.82 C 

94.967 3.371 1.00 65.13 C 

98.099 6.169 1.00 71.00 C 

98.914 6.385 1.00 70.68 O 

97.986 6.913 1.00 70.43 N 

98.756 8.124 1,00 69.86 C 

99.021 8.279 1.00 70.46 C 

100.007 9.385 1.00 73.80 C 

99.319 10.635 1.00 77.97 C 

98.794 10.625 1.00 79.52 O 
99.330 11.717 1.00 78.94 N 
97.967 9.322 1.00 68.49 C 
97.003 9.777 1.00 68.58 O 
98.390 9.834 1.00 66.87 N 
97.688 10.940 1.00 65.53 C 
98.137 11.138 1.00 65.20 C 
97.201 12.100 1.00 64.63 C 
98.174 9-804 1.00 65.16 C 
97.837 12.255 1.00 64.96 C 
98.893 12.547 1.00 65.27 O 
96.766 13.042 1.00 64.02 N 
96.764 14.310 1.00 62.22 C 
96.341 15.451 1.00 61.31 C 
96.572 15.406 1.00 60.92 O 
95.725 16.478 1.00 60.62 N 
95.313 17.666 1,00 60.91 C 
94.755 18.602 1.00 60.81 C 
95.332 18.089 1.00 60.97 C 
95.401 16.599 1.00 60.64 C 
94.235 17.385 1.00 60,96 C 
93.478 16.424 1.00 60.93 O 
94.180 18.226 1.00 61.11 N 
93.084 18.200 1.00 61.79 C 
93.411 19.110 1.00 61.59 C 
92.511 19.107 1.00 62.19 C 
92.704 17.848 1.00 61.21 C 
92.782 20.344 1.00 62.67 C 

91.795 18.643 1.00 62.48 C 
91.824 19.511 1.00 62.19 O 
90.671 18.030 1.00 63.26 N 
89.378 18.387 1.00 63.89 C 
88.663 17.154 1.00 63.70 C 
89.362 16.379 1.00 63.05 C 
88.562 15.131 1.00 62.30 C 
89.539 17.241 1.00 62.27 C 
88.474 19.110 1.00 64.82 C 
87.669 19.951 1.00 65.08 O 
88.602 18.784 1.00 66.08 N 
87.734 19.357 1.00 68.09 C 
88.122 19.027 1.00 69.52 C 
88.937 18.124 1.00 69.05 O 
87.517 19.774 1.00 71.08 N 
87.764 19.662 1.00 72.58 C 
88.922 20.588 1.00 72.92 C 
89.327 20.364 1.00 74.04 O 
86.513 20.027 1.00 73.06 C 
85.720 20.875 1.00 73.26 O 
86.349 19.394 1.00 73.67 N 
85.234 19.707 1.00 74.04 C 
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84.965 18.608 1.00 74.12 C 

85.889 17.883 1.00 74.48 O 

83.698 18,490 1.00 73.84 N 

83.257 17.478 1.00 73.16 C 

83.392 16.048 1.00 72.45 C 

83.374 15.093 1.00 72.37 O 

83.531 15.911 1.00 71.52 N 

83.755 14.612 1.00 70.43 C 

83.187 14.590 1.00 70.83 C 

83.752 15.661 1.00 73.19 C 

84.933 15.439 1.00 74.32 C 

85.453 16.419 1.00 75.71 C 

84.787 17.638 1.00 75.96 C 

83.605 17.874 1.00 75.42 C 

83.090 16.883 1.00 74.82 C 
85.231 14.157 1.00 68.55- C 
85.509 12.963 1.00 69.15 0 
86.164 15.106 1.00 66.20 N 
87.588 14.795 1.00 62.55 C 
88.351 15.578 1.00 59-75 C 
88.056 16.754 1.00 59.86 O 
89.332 14.916 1.00 56.20 N 
90.148 15.480 1.00 51.75 C 
91.614 15-587 1.00 51.18 C 
91.750 16.401 1.00 48.30 O 
90.076 14.537 1.00 49.79 C 
90.341 13.344 1.00 48.22 0 
89.724 15.064 1.00 47.77 N 
89.551 14.243 1.00 46.96 C 

88.091 14.312 1.00 46.48 C 
87.924 13.397 1.00 45.18 C 
87.082 13.955 1.00 45.83 C 
90.515 14.701 1.00 47.44 C 
90.493 15.869 1.00 48.09 O 
91.346 13.779 1.00 47.68 N 
92.315 14.055 1.00 48.21 C 
93.711 13.532 1.00 47.13 C 
94.335 14.223 1.00 46.28 C 
93.949 13.897 1.00 43.98 C 
94.495 14.532 1.00 42.59 C 
95.475 15.521 1.00 43.72 C 
96.012 16.159 1.00 43.55 O 
95.884 15.864 1.00 44.06 C 
95.305 15.221 1.00 45.68 C 
91.886 13.363 1.00 49.25 C 
91,335 12.258 1.00 48.07 O 
92.162 14.008 1.00 50.84 N 
91.959 13.398 1.00 53.09 C 

92.092 14.445 1.00 53.24 C 
91.939 13.854 1.00 55.73 0 
93.019 12.342 1.00 54.41 C 
94.129 12.510 1.00 55.46 0 
92.691 11.248 1.00 56.17 N 
93.625 10.143 1.00 58.09 C 
93.391 9.327 1.00 60.22 C 
92.343 9.433 1,00 60.35 0 
94.381 8.513 1.00 62.12 N 
94.305 7.635 1.00 64.34 C 
95.188 8.169 1.00 64.42 C 
94.639 9.500 1.00 65.47 C 
95.670 10.281 1.00 66.61 C 
95.273 7.149 1.00 65.46 C 
94.780 6.267 1.00 65.56 C 
95.805 6.155 1.00 65.96 0 
94.017 5.233 ^^,4^^67.26 N 
94.386 3.858 *^^toS&8. 96 C 
93.161 2.94'6 1.00 68.74 C 
93.361 1.520 1.00 68.49 C 
92.424 0.532 1.00 68.56 C 
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85.495 10.666 1.00 44.83 C 

84.780 11.488 1.00 46.29 C 
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89.833 9-867 1.00 41.57 C 

89.773 8.671 1.00 40.97 O 
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91.087 10.553 1.00 43.17 C 

92.419 11.318 1.00 42.46 C 

93.440 10.858 1.00 40.77 C 

93.384 11.203 1.00 43.58 N 

94.393 10.640 1.00 41.67 C 

95.100 9.942 1.00 41.02 N 

94.522 10.062 1.00 37.93 C 

90.217 10.960 1.00 45.01 C 

89.744 12.089 1.00 44.97 O 

90.027 10.030 1.00 46.61 N 

89.225 10.272 1.00 49.40 C 

87.969 9.359 1.00 49.48 C 

87.093 9.675 1.00 50.89 C 

87.146 9.529 1.00 49.70 C 

90.103 10.038 1.00 51.36 C 

90.721 8.983 1.00 50.89 O 

90.162 11.026 1.00 53.97 N 

90.924 10.896 1.00 57.27 C 

91.219 12.272 1.00 57.98 C 

92.205 13.105 1.00 61.81 C 

93.240 13.839 1.00 66.09 C 

93.928 14.744 1.00 67.00 O 
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92.635 3.848 1.00 65.13 C 
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89.589 8.987 1.00 64.37 C 

88.462 8.737 1.00 64.24 O 
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89.928 10.820 1.00 70.03 C 
91.037 11.814 1.00 70.83 C 
92.162 11.177 1.00 73.07 C 
92.764 10.174 1.00 75.44 O 
92.511 11.611 1.00 74.89 O 
88.614 11.551 1.00 71.40 C 
87.895 11.851 1.00 71.59 O 
88.287 11.805 1.00 73.23 N 
87.068 12.540 1.00 74.86 C 
87.380 13.602 1.00 75.53 C 
88.293 14-718 1.00 79-03 C 
88.151 16.043 1.00 84.58 C 
86.752 16.394 1.00 88.34 N 
85.901 16.878 1.00 89.85 C 
86.293 17.083 1.00 90.48 M 
84.650 17.161 1.00 90.61 N 

85.929 11.630 1.00 75.02 C 
85.067 12.054 1.00 75.13 O 
85-926 10.386 1.00 75.26 N 
84.885 9.420 1.00 75.36 C 
85.503 8.127 1.00 75.17 C 
86.425 8.465 1.00 74.84 C 
87.204 7.280 1.00 75.37 C 
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91.672 1.425 1.00 49.52 N 

92.912 1.077 1.00 50.21 C 

94.130 1.649 1.00 49.95 C 

94.259 1.182 1.00 51.89 C 

95.026 0.061 1.00 53.85 C 

95.139 -0.376 1.00 54.01 C 

94.477 0.310 1.00 54.23 ' C 

93.710 1.426 1.00 53.04 C 

93.611 1.855 1.00 52.18 C 

92,848 1.657 1.00 50.39 C 

92.251 2.711 1.00 49.25 O 
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96.489 5.220 1.00 54,07 C 

95.812 4.914 1.00 54.49 O 
96.342 6.382 1.00 54.16 N 
95.378 7.346 1.00 54.11 C 
96.112 8.575 1.00 55.71 C 
96.989 8.245 1.00 58.46 C 
98.057 7.607 1.00 61.78 O 
96.681 8.573 1.00 60.03 O 
94.269 7.755 1.00 52.78 C 
93.548 8.727 1.00 53.03 O 
94.111 6.999 1.00 50.98 N 
93.105 7.331 1.00 48.41 C 
93.649 8.439 1,00 48.04 C 
94.528 7.893 1.00 45.84 O 
92.666 6.155 1,00 47.32 C 
93.285 5.096 1.00 46.92 O 
91.609 6.381 1.00 45.66 N 
91.174 5.446 1.00 44.29 C 
89.746 4.994 1.00 44.76 C 
89.632 3.980 1,00 45.24 C 
89.484 4.385 1,00 45.34 C 
89.360 3.447 1.00 47.24 C 
89.387 2.077 1.00 46.18 C 
89.531 1.661 1.00 48.08 C 
89.646 2.615 1.00 47.94 C 
91,238 6.141 1.00 43.43 C 
90.945 7.333 1.00 42.72 O 
91.641 5.411 1.00 42.24 N 
91.681 5.945 1,00 41.74 C 
93.087 5.818 1.00 41.89 C 

93.136 6.447 1.00 41.40 C 
94.122 6.451 1-00 42.52 C 
90.652 5.209 1.00 41.80 C 
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ATOM 


659 


O 


VAL A 117 


13,507 


90.660 


3. 974 


1 . 00 40.77 


\J 


ATOM 


660 


N 


LEU A 118 


12.934 


89.765 


5.974 


1.00 41.04 


N 


ATOM 


661 


CA 


LEU A 118 


12.094 


88.693 


5. 410 


1.00 40.22 




ATOM 


662 


CB 


LEU A 118 


12.520 


87.337 


5.977 


1.00 39.98 


Q 


ATOM 


663 


CG 


LEU A 118 


13.795 


86.695 


5.423 


1.00 40.10 




ATOM 


664 


GDI 


LEU A 118 


15.014 


87 .562 


5. 619 


1.00 42.56 




ATOM 


665 


CD2 


LEU A 118 


14.032 


85.325 


6.0 63 


1.00 39.40 




ATOM 


666 


C 


LEU A 118 


10. 635 


88.939 


5.720 


1.00 40.25 


Q 


ATOM 


667 


O 


LEU A 118 


10.275 


89.223 


6.861 


1,00 39.35 


0 


ATOM 


668 


N 


ILE A 119 


9.795 


88 . 842 


4.700 


1.00 39.45 


N 


ATOM 


669 


CA 


ILE A 119 


8.370 


89.028 


4.876 


1.00 40.37 


0 


ATOM 


67 0 


CB 


ILE A 119 


7. 774 


89.921 


3.756 


1.00 40.19 


Q 


ATOM 


671 


CGI 


ILE A 119 


8.555 


91.248 


3. 612 


1.00 41.49 


Q 


ATOM 


672 


CDl 


ILE A 119 


8.540 


92.131 


4 . 855 


1.00 40.16 


Q 


ATOM 


673 


CG2 


ILE A 119 


6.296 


90.136 


3.989 


1.00 39.40 


c 


ATOM 


674 


C 


ILE A 119 


7.748 


87.638 


4 .823 


1.00 41.09 




ATOM 


675 


O 


ILE A 119 


7.793 


86. 966 


3.788 


1.00 40.59 


0 


ATOM 


676 


N 


LEU A 120 


7.167 


87.222 


5.939 


1.00 41.43 


N 


ATOM 


677 


CA 


LEU A 120 


6.634 


85.872 


6.07 6 


1.00 42.78 


c 


ATOM 


678 


CB 


LEU A 120 


7.355 


85.144 


7.216 


1,00 41,61 


c 


ATOM 


679 


CG 


LEU A 120 


8.868 


85.010 


7 .046 


1.00 40.99 


c 


ATOM 


680 


CDl 


LEU A 120 


9.558 


84.928 


8.402 


l.OO 43.47 


c 


ATOM 


681 


CD2 


LEU A 120 


9.234 


83.785 


6.187 


1.00 42.48 


c 


ATOM 


682 


C 


LEU A 120 


5,138 


85.922 


6.330 


1.00 44.08 


c 


ATOM 


683 


O 


LEU A 120 


4 , 604 


86.963 


6.715 


1.00 44.77 


o 


ATOM 


684 


N 


GLU A 121 


4.449 


84.808 


6.109 


l.OO 45.28 


N 


ATOM 


685 


CA 


GLU A 121 


3.026 


84.738 


6. 436 


1.00 47.09 


c 


ATOM 


686 


CB 


GLU A 121 


2. 430 


83. 409 


5. 985 


1.00 47.99 


c 


ATOM 


687 


CG 


GLU A 121 


2.534 


83.123 


4.497 


1.00 49.97 


c 


ATOM 


688 


CD 


GLU A 121 


1.959 


81.759 


4.170 


1.00 53.32 


c 


ATOM 


689 


OEl 


GLU A 121 


0.911 


81.714 


3.506 


1-00 55.61 


o 


ATOM 


690 


OE2 


GLU A 121 


2.548 


80.735 


4.586 


1.00 52.97 


o 


ATOM 


691 


C 


GLU A 121 


2.841 


84.862 


7.938 


1.00 47.78 


c 


ATOM 


692 


O 


GLU A 121 


3.753 


84.550 


8.711 


1.00 47.35 


o 


ATOM 


693 


N 


ARG A 122 


1.670 


85.326 


8.351 


1,00 48.92 




ATOM 


694 


CA 


ARG A 122 


1.369 


85.439 


9.766 


1.00 50.98 


C 


ATOM 


695 


CB 


ARG A 122 


1.555 


86.883 


10.257 


1.00 50.78 


C 


ATOM 


696 


CG 


ARG A 122 


1.196 


87.085 


11,730 


1.00 51.57 


c 


ATOM 


697 


CD 


ARG A 122 


1.716 


88.383 


12.349 


1.00 51.63 


C 


ATOM 


698 


NE 


ARG A 122 


1.119 


89.578 


11.744 


1.00 52.11 


N 


ATOM 


699 


CZ 


ARG A 122 


-0.133 


89. 977 


11.951 


1,00 51.61 


C 


ATOM 


700 


NHl 


ARG A 122 


-0.937 


89.274 


12.741 


1.00 51.42 


N 


ATOM 


701 


NH2 


ARG A 122 


-0.588 


91.070 


11.354 


1.00 50.64 


N 


ATOM 


702 


C 


ARG A 122 


-0.054 


84.965 


10.017 


1.00 52.37 


C 


ATOM 


703 


O 


ARG A 122 


-1.005 


85.749 


9.922 


1.00 52.94 


O 


ATOM 


704 


N 


PRO A 123 


-0.211 


83. 682 


10.328 


1.00 53.57 


N 


ATOM 


705 


CA 


PRO A 123 


-1.529 


83.141 


10.672 


1.00 54.00 


C 


ATOM 


706 


CB 


PRO A 123 


-1.227 


81.669 


10.973 


1.00 54.39 


c 


ATOM 


707 


CG 


PRO A 123 


0.057 


81.394 


10.250 


1.00 54.18 


c 


ATOM 


708 


CD 


PRO A 123 


0.845 


82.652 


10.398 


1.00 53.87 


C 


ATOM 


709 


C 


PRO A 123 


-2.012 


83.835 


11.928 


1.00 54.39 


C 


ATOM 


710 


O 


PRO A 123 


-1.172 


84.333 


12.676 


1-00 54.50 


o 


ATOM 


711 


N 


GLU A 124 


-3.322 


83.862 


12.164 


1.00 54.85 


N 


ATOM 


712 


CA 


GLU A 124 


-3,859 


84.533 


13.348 


1.00 55, 63 


C 


ATOM 


713 


CB 


GLU A 124 


-3.870 


86.046 


13,089 


1.00 56.97 


C 


ATOM 


714 


CG 


GLU A 124 


^3.856 


86. 946 


14.335 


1.00 62.93 


C 


ATOM 


715 


CD 


GLU A 124 


-4.020 


88.417 


13.933 


1.00 69.90 


C 


ATOM 


716 


OEl 


GLU A 124 


-4.962 


88.742 


13.153 


1.00 72.13 


0 


ATOM 


717 


OE2 


GLU A 124 


-3.195 


89.256 


14.385 


1.00 71.98 


o 


ATOM 


718 


C 


GLU A 124 


-5.270 


84.018 


13.671 


1.00 53,97 


c 


ATOM 


719 


O 


GLU A 124 


-6.093 


83.910 


12.764 


1.00 54.52 


0 


ATOM 


720 


N 


PRO A 125 


-5.563 


83. 673 


14,930 


1.00 52.32 


N 


ATOM 


721 


CA 


PRO A 125 


-4.601 


83.673 


16.040 


1.00 51.14 


C 


ATOM 


722 


CB 


PRO A 125 


-5.504 


83.553 


17.275 


1.00 51.19 


C 


ATOM 


723 


CG 


PRO A 125 


-6.689 


82.766 


16.783 


1.00 50.94 


C 


ATOM 


724 


CD 


PRO A 125 


-6. 906 


83.255 


15.374 


1.00 51.87 


C 


ATOM 


725 


C 


PRO A 125 


-3.694 


82.453 


15. 970 


1.00 50.27 


C 


ATOM 


726 


O 


PRO A 125 


-4.057 


81.436 


15.379 


1.00 49.98 


o 


ATOM 


727 


N 


VAL A 126 


-2.526 


82.562 


16.588 


1.00 49.17 


N 
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ATOM 


728 


CA 


VAL 


A 


126 


-1. 


525 


ATOM 


729 


CB 


VAL 


A 


126 


-0, 


,508 


ATOM 


730 


CGI 


VAL 


A 


126 


0. 


307 


ATOM 


731 


CG2 


VAL 


A 


126 


0. 


400 


ATOM 


732 


C 


VAL 


A 


126 


-0. 


,848 


ATOM 


733 


O 


VAL 


A 


126 


-0. 


,787 


ATOM 


734 


N 


GLN 


A 


127 


-0. 


.360 


ATOM 


735 


CA 


GLN 


A 


127 


0. 


,495 


ATOM 


736 


CB 


GLN 


A 


127 


-0. 


,356 


ATOM 


737 


CG 


GLN 


A 


127 


0. 


,414 


ATOM 


738 


CD 


GLN 


A 


127 


-0, 


.498 


ATOM 


739 


OEl 


GLN 


A 


127 


-1, 


.346 


ATOM 


740 


NE2 


GLN 


A 


127 


-0. 


.336 


ATOM 


741 


C 


GLN 


A 


127 


1, 


.500 


ATOM 


742 


O 


GLN 


A 


127 


1, 


.136 


ATOM 


743 


N 


ASP 


A 


128 


2, 


.755 


ATOM 


744 


CA 


ASP 


A 


128 


3, 


.719 


ATOM 


745 


CB 


ASP 


A 


128 


5, 


.174 


ATOM 


746 


CG 


ASP 


A 


128 


5.670 


ATOM 


747 


ODl 


ASP 


A 


128 


5 


.553 


ATOM 


748 


OD2 


ASP 


A 


128 


6 


.231 


ATOM 


749 


C 


ASP 


A 


128 


3 


.482 


ATOM 


750 


O 


ASP 


A 


128 


2. 


.898 


ATOM 


751 


N 


LEU 


A 


129 


3 


.908 


ATOM 


752 


CA 


LEU 


A 


129 


3 


.665 


ATOM 


753 


CB 


LEU 


A 


129 


4 


.146 


ATOU 


754 


CG 


LEU 


A 


129 


3 


.950 


ATG>. 


755 


GDI 


LEU 


A 


129 


2 


.485 


ATOn 


756 


CD2 


LEU 


A 


129 


4 


.490 


ATCVI 


757 


C 


LEU 


A 


129 


4 


.281 


ATO,'1 


7^8 


o 


LEU 


A 


129 


3 


.730 


ATOM 


7 59 


N 


PHE 


A 


130 


5 


.422 


ATOM 


?6C 


CA 


PHE 


A 


130 


6 


.047 


ATOM 


?fc.U 


CB 


PHE 


A 


130 


7 


.342 


ATOM 


762 


CG 


PHE 


A 


130 


8 


.070 


ATOM 


763 


GDI 


PHE 


A 


130 


7 


.678 


ATOM 


764 


CEl 


PHE 


A 


130 


8 


.349 


ATOM 


7 65 


CZ 


PHE 


A 


130 


9 


.404 


ATOM 


766 


C.E2 


PHE 


A 


130 


9 


.806 


ATOM 


7 67 


CD2 


PHE 


A 


130 


9 


.132 


ATOM 


768 


C 


PHE 


A 


130 


5 


.099 


ATOM 


769 


o 


PHE 


A 


130 


4 


.849 


ATOM 


770 


N 


ASP 


A 


131 


4 


.596 


ATOM 


771 


CA 


ASP 


A 


131 


3 


.719 


ATOM 


772 


CB 


ASP 


A 


131 


3 


.418 


ATOM 


773 


CG 


ASP 


A 


131 


4 


. 620 


ATOM 


774 


ODl 


ASP 


A 


131 


5 


.574 


ATOM 


775 


OD2 


ASP 


A 


131 


4 


.702 


ATOM 


776 


C 


ASP 


A 


131 


2 


.433 


ATOM 


777 


O 


ASP 


A 


131 


1 


.888 


ATOM 


778 


N 


PHE 


A 


132 


1 


.972 


ATOM 


779 


CA 


PHE 


A 


132 


0 


.760 


ATOM 


780 


CB 


PHE 


A 


132 


0 


.459 


ATOM 


781 


CG 


PHE 


A 


132 


-0 


.854 


ATOM 


782 


GDI 


PHE 


A 


132 


-2 


. 039 


ATOM 


783 


CEl 


PHE 


A 


132 


-3 


.251 


ATOM 


784 


CZ 


PHE 


A 


132 


-3 


.297 


ATOM 


785 


CE2 


PHE 


A 


132 


-2 


.123 


ATOM 


786 


CD2 


PHE 


A 


132 


-0 


. 910 


ATOM 


787 


C 


PHE 


A 


132 


0 


. 902 


ATOM 


788 


O 


PHE 


A 


132 


-0 


.006 


ATOM 


789 


N 


ILE 


A 


133 


2 


.040 


ATOM 


790 


CA 


ILE 


A 


133 


2 


.286 


ATOM 


791 


CB 


ILE 


A 


133 


3 


.487 


ATOM 


792 


CGI 


ILE 


A 


133 


3 


.071 


ATOM 


793 


CDl 


ILE 


A 


133 


4 


.220 


ATOM 


794 


CG2 


ILE 


A 


133 


4 


.026 


ATOM 


795 


C 


ILE 


A 


133 


2 


.514 


ATOM 


796 


o 


ILE 


A 


133 


2 


.046 



81.526 16.500 1.00 48-07 c 

81.866 15.373 1.00 48.55 c 

83.104 15.726 1.00 49.90 c 

80.711 15.096 1.00 50.31 C 

81.346 17.855 1.00 46.31 C 

82.290 18.644 1.00 46.40 O 

80.129 18.117 1.00 43.89 N 

79.825 19.270 1.00 41.91 C 

79.311 20.438 1.00 41.96 C 

79.085 21.751 1.00 41.25 C 

78.618 22.880 1.00 41.99 . C 

77.744 22.688 1.00 41.61 O 

79.214 24.052 1.00 40.70 N 

78.750 18.868 1.00 41.00 C 

77.807 18.153 1.00 40.30 O 

78.874 19.307 1.00 40.28 N 

77.829 18.995 1.00 39.77 C 

78.319 19.018 1.00 40.70 C 

78.691 20.390 1.00 42.88 C 

77.903 21.369 1.00 46.75 O 

79.788 20.562 1.00 48.33 O 

76.609 19-881 1.00 39.44 C 

76.723 20.978 1,00 38.51 O 

75.443 19.392 1.00 37.35 N 

74.196 20.084 1.00 35.60 C 

73.015 19.224 1.00 33.96 C 

71.607 19.773 1.00 34.42 C 

71.352 20.108 1.00 29.98 C 

70.573 18.768 1.00 33.34 C 

74.165 21.489 1.00 35.69 C 

73,565 22.403 1.00 34.94 O 

74.804 21.659 1.00 36.83 N 

74.850 22.976 1.00 38.88 C 
75.665 22.930 1.00 39.30 C 
75.714 24.254 1.00 42.49 C 
76.621 25.251 1.00 45.52 C 
76.680 26.489 1.00 46.12 C 
75.807 26.747 1.00 46.89 C 
74.886 25.758 1.00 47-58 C 

74.851 24.514 1.00 44.28 C 
75.492 23.989 1.00 39.51 C 
74.931 25.064 1.00 38.80 O 
76.679 23.657 1.00 40.69 N 
77.401 24.583 1.00 42.23 C 

78.805 24.088 1.00 43.13 C 

79.699 24.113 1.00 44.46 C 
79.431 24.874 1.00 47.95 O 

80.700 23.375 1.00 49.81 O 
76.642 24.771 1.00 42.09 C 
76.600 25.877 1.00 42-20 O 
76.001 23-696 1-00 41.67 N 
75.195 23.744 1-00 41.50 C 
74.630 22.358 1.00 41-82 C 
73.909 22.263 1.00 40.63 C 
74.618 22.148 1.00 40.87 C 
73.965 22.053 1.00 41.37 C 
72.581 22.062 1.00 41.96. C 
71.855 22.173 1.00 40.31 C 
72.524 22.281 1.00 42.21 C 
74.058 24.760 1.00 42.73 C 
73.809 25.570 1.00 42.57 O 
73.369 24.718 1.00 42.81 N 
72.247 25.630 1.00 43.91' C 
71.395 25.138 1.00 42.90 C 
70-570 23.920 1-00 41.19 C 
69.961 23.174 1-00 41.52 C 
70.461 26.249 1.00 42.29 C 
72.780 27.054 1.00 46.10 C 
72.191 28.023 1.00 46.11 O 
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t\l VJIM 


797 


N 


THR 


A 


134 


3 


. 231 


73 


.894 


27.162 


1.00 


48 


. 42 


N 




798 


CA 


THR 


A 


134 


3 


.487 


74 


.529 


28.453 


1.00 


51 


.23 


C 


Ai UJXl 


799 


CB 


THR 


A 


134 


4 


.314 


75 


.805 


28.261 


1.00 


51 


.04 


C 


Ai UM 


800 


OGl 


THR 


A 


134 


5 


.695 


75 


.440 


28.115 


1.00 


52 


.55 


O 




801 


CG2 


THR 


A 


134 


4 


.293 


76 


. 695 


29.524 


1.00 


53 


.00 


C 


TV rn /^TiyT 


802 


C 


THR 


A 


134 


2 


.179 


74 


.851 


29.159 


1.00 


52 


.34 


C 




803 


O 


THR 


A 


134 


2 


.069 


74 


.673 


30.365 


1.00 


53 


.35 


o 




804 




GLU 


A 


135 


1 


.188 


75 


.303 


28.400 


1.00 


53 


.42 


N 


Al UM 


805 


CA 


GLU 


A 


135 


-0.110 


75 


. 662 


28.959 


1.00 


54 


.43 


C 


TV m/^i\yr 

ATOM 


806 


CB 


GLU 


A 


135 


-0 


.840 


76 


.644 


28.038 


1.00 


55 


.03 


C 


ATOM 


807 


CG 


GLU 


A 


135 


-0 


.137 


78 


.004 


27.941 


1.00 


59 


.01 


c 


ATOM 


808 


CD 


GLU 


A 


135 


-0 


.981 


79 


.054 


27.234 


1.00 


62 


.58 


c 


ATOM 


809 


OEl 


GLU 


A 


135 


-1 


.942 


78 


. 685 


26.505 


1.00 


64 


.20 


o 


ATOM 


810 


OE2 


GLU 


A 


135 


-0 


. 675 


80 


.254 


27.412 


1.00 


63 


. 63 


o 


ATOM 


811 


C 


GLU 


A 


135 


-1 


.009 


74 


.468 


29.215 


1.00 


53 


. 96 


c 


ATOM 


812 


O 


GLU 


A 


135 


-1 


.743 


74 


.440 


30.206 


1.00 


54 


.61 


o 


ATOM 


813 


N 


ARG 


A 


136 


-0 


.975 


73 


.490 


28.318 


1.00 


52 


.83 


N 


ATOM 


814 


CA 


ARG 


A 


136 


-1 


.947 


72 


.404 


28.385 


1.00 


51 


. 61 


C 


ATOM 


815 


CB 


ARG 


A 


136 


-2 


. 646 


72 


.261 


27.036 


1.00 


52 


.42 


C 


ATOM 


816 


CG 


ARG 


A 


136 


-3 


.486 


73 


. 503 


26.736 


1.00 


55 


.25 


C 


ATOM 


817 


CD 


ARG 


A 


136 


-4 


.130 


73 


.538 


25.378 


1.00 


58 


.99 


c 


ATOM 


818 


NE 


ARG 


A 


136 


-4 


.990 


72 


.381 


25.145 


1.00 


60 


,79 


N 


ATOM 


819 


cz 


ARG 


A 


136 


-6 


.072 


72 


.415 


24-379 


1.00 


61 


.01 


C 


ATOM 


820 


NHl 


ARG 


A 


136 


-6 


.425 


73 


.559 


23.777 


1.00 


60 


.57 


N 


ATOM 


821 


NH2 


ARG 


A 


136 


-6 


.793 


71 


.310 


24.207 


1.00 


60 


.45 


N 
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74.131 15.396 1.00 44.41 C 
73.391 16.374 1.00 44.09 O 
75.349 15.606 1.00 47.46 O 

71.870 14.231 1.00 37.42 C 
72.783 14.722 1.00 38.05 O 
70.671 13.939 1.00 36.17 N 
70.380 14.201 1.00 34.82 C 
69.064 13.505 1.00 35.54 C 
69.196 11.976 1.00 34.25 C 
67.836 11.258 1.00 36.78 C 
68.692 13.951 1.00 34.38 C 
70.252 15.708 1.00 34.43 C 
69.409 16.350 1.00 32.59 O 
71.091 16.260 1.00 33.96 N 
71.056 17.701 1.00 33.91 C 
71.647 18.511 1.00 34.23 C 
73.064 18.140 1.00 35.79 C 
73,538 18.954 1.00 40.57 C 
75.024 18,631 1.00 44.31 C 
75.616 19.597 1.00 47.28 N 
71.860 17.947 1.00 33.20 C 
72.619 17.065 1.00 32.12 O 
71.731 19.138 1.00 33.18 N 
72.433 19.438 1.00 35.25 C 

72.132 20.839 1.00 36.00 C 
72.296 21.895 1.00 38.87 C 
72.820 21.626 1.00 41.81 O 
71.892 23,042 1.00 42.14 O 
73.934 19.226 1.00 35.41 C 

74.510 18.717 1.00 35.26 O 
74.550 19.570 1.00 36.73 N 
76.005 19.446 1.00 38.95 C 
76.462 20.053 1.00 40.09 C 
76.109 21.523 1.00 46.32 C 
76.436 22,017 1,00 53.18 C 
75.621 21.798 l.OO 56.05 O 

77.511 22.636 1.00 57.82 O 
76.438 17.983 1.00 38.68 C 
77.583 17.678 1.00 38.39 O 
75.531 17.080 1.00 37.88 N 
75.848 15.653 1.00 37.39 C 
75.288 15.019 1.00 37.23 C 
76.063 15.433 1.00 38.63 C 
77.241 15.769 1.00 38.90 O 
75.407 15.390 1.00 36.90 N 
75.353 14.917 1.00 36.59 C 
75.150 13.710 1.00 36,84 O 
75.141 15-653 1,00 35.61 N 
74.742 15.085 1.00 35.39 C 
73.292 15.516 1.00 35.63 C 
72.283 14.897 1.00 36.07 C 

70.871 15,388 1.00 33.90 C 
72.954 15.161 1.00 35.62 C 
75.691 15.548 1.00 36.51 C 
75.863 16.767 1,00 35.35 O 
76.286 14.588 1.00 36.38 N 
77.192 14.900 1.00 37.97 C 
78.506 14.107 1.00 38.37 C 
79.472 14.437 1.00 41.65 C 
80,715 13.531 1.00 44.42 C 
79.905 15.882 1.00 42.22 C 
76.518 14.642 1.00 38.02 C 
75.822 13.642 1.00 37.64 O 
76,721 15.567 1.00 37.87 N 

76.199 15.417 1.00 38.43 C 
75.563 16.744 1.00 38.70 C 
74.510 17.228 1.00 38.66 C 

74.200 18.722 1.00 41.58 C 
74.971 16.609 1.00 36.19 *C 
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1270 


0 


LEU 


A 


192 


21. 


187 


ATOM 


1271 


N 


LEU 


A 


193 


19. 


227 


ATOM 


1272 


CA 


LEU 


A 


193 


19. 


237 


ATOM 


1273 


CB 


LEU 


A 


193 


17. 


870 


ATOM 


1274 


CG 


LEU 


A 


193 


17. 


658 


ATOM 


1275 


CDl 


LEU 


A 


193 


17. 


805 


ATOM 


1276 


CD2 


LEU 


A 


193 


16. 


279 


ATOM 


1277 


C 


LEU 


A 


193 


20. 


306 


ATOM 


1278 


O 


LEU 


A 


193 


20. 


386 


ATOM 


1279 


N 


LYS 


A 


194 


21. 


110 



152 



73.726 9.494 1.00 38.41 c 

73.126 9.811 1.00 42.44 C 

72.736 11.300 1.00 44.98 C 

71.900 8.895 1.00 44.78 C 

76.195 10.135 1.00 37.17 c 

76.886 9.159 1.00 34.75 O 

76.590 11.085 1.00 37.08 N 

77.882 11.007 1.00 38.22 c 

78.967 11.964 1.00 38.24 c 

78.523 13.428 1.00 38.15 c 

79.649 14.460 1.00 40.09 c 

79.280 11.555 1.00 37.83 c 

77.766 11.365 1.00 38.83 c 

76.792 12.000 1.00 38.96 0 

78.807 10.980 1.00 39.40 N 

78.985 11.315 1.00 40.27 c 

79.073 12.830 1.00 41.48 c 

79.582 13.178 1.00 45.31 c 

79.992 12.255 1.00 47.25 O 

79.588 14.351 1.00 50.71 o 

78.013 10.648 1,00 40.90 c 

77.096 11.285 1.00 40.21 O 

78.292 9.378 1.00 40.98 N 

77.522 8.591 1.00 42.42 C 

77.517 7.140 1.00 41.27 C 
76.662 6.891 1.00 41.11 C 
77.128 7.202 1.00 40.65 C 
76.342 6.977 1.00 39.93 C 
75.078 6.444 1.00 40.45 ' C 
74.586 6.128 1.00 38.34 C 
75.373 6.349 1.00 40.39 C 
78.031 8.711 1.00 43.76 C 
77.647 7.927 1.00 44.80 O 
78.868 9.716 1.00 44.55 N 
79.449 9.940 1.00 45.36 C 
78.493 10.252 1.00 45.97 C 
78.870 10.168 1.00 46.90 O 
77.260 10.631 1.00 46.13 N 
76.240 10.866 1.00 45.91 C 
75-652 12.280 1.00 46.57 C • 
76.674 13.258 1.00 51.43 O 
75.111 9.825 1.00 44.92 C 
74.051 9.977 1.00 43.79 O 
75.337 8.784 1.00 44.05 N 
74.310 7.784 1.00 43.67 C 
74.120 6.783 1.00 43.35 C 
74.764 6.862 1.00 41.72 O 
73.222 5.825 1.00 42.84 N 
72.942 4.769 1.00 41.99 C 
72.067 5.294 1.00 41.84 C 
72.305 3.551 1.00 42.18 C 
71.813 3.608 1.00 41.24 O 
72.347 2.428 1,00 42.32 N 
71.649 1.236 1.00 42.24 C 
71.977 0.031 1-00 43.21 C 

73.455 -0.389 1.00 46.28 C 
73.763 -1.328 1.00 49.62 C 
73,889 -1.025 1.00 46.71 C 
70.179 1,589 1.00 41.12 Q 
69.742 2.270 1.00 39.72 O 
69.428 1.190 1.00 41.80 N 
67.983 1.401 1.00 43.29 c 
67.405 1.066 1.00 43.47 c 
65.896 1.213 1.00 45.93 c 

65.456 2.67: .1.00 45.54 c 

65.518 0.652 1.00 46.41 C 
67.331 0.512 1.00 43.97 c 
67.649 -0.667 1.00 44.14 o 
66.434 1.084 1.00 44.53 N 
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ATOM 
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45.29 


C 
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1, 
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.00 
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.144 


1, 
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1. 


.00 
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C 
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CE 


LYS 


A 


194 


25. 
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3, 
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.00 


46.87 


C 
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27. 


.243 


67. 
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3. 


.464 


1 


.00 


48.08 


N 
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A 
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134 
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0, 
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.00 
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O 
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A 
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,615 
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2, 


.026 
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O 
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N 
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A 
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63. 
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0. 
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N 
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CA 


ASP 
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0, 


.609 
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C 
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CB 
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C 
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CG 
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.00 
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-0. 
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.00 
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0 
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C 
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A 
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1, 


.384 
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.00 


45.41 


C 


ATOM 
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O 
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A 
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24, 
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.519 


1. 


.950 


1, 


.00 
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.00 
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.00 


46.75 


c 
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A 
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3. 
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.00 


46.70 


0 
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N 
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N 


ATOM 


1304 


CA 


VAL 


A 


197 


27. 
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.00 
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C 
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A 
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801 


62, 
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1. 


.00 


50.58 


C 
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1307 
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A 
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282 


62. 
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956 
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.00 


49. 66 


c 
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C 


VAL 


A 
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27. 


366 


64. 


409 


5. 
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.00 


48. 10 


c 
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1309 


O 
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A 
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27. 


949 


65. 
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5. 
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.00 
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.00 


47.69 
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412 
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951 
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43.05 
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23. 
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67. 
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12. 


664 


1. 


00 


60.22 


N 


ATOM 


1338 


CA 


PHE 


A 


201 


25. 


061 


71. 


089 


13. 


315 


1. 


00 


61.47 


C 


ATOM 


1339 


CB 


PHE 


A 


201 


24. 


847 


69. 


790 


14. 


094 


1. 


00 


61.39 


C 


ATOM 


1340 


CG 


PHE 


A 


201 


23. 


526 


69. 


717 


14 . 


805 


1. 


00 


61.76 


C 


ATOM 


1341 


CDl 


PHE 


A 


201 


22. 


342 


69. 


550 


14 . 


085 


1. 


00 


62.43 


c 


ATOM 


1342 


CEl 


PHE 


A 


201 


21. 


110 


69. 


475 


14. 


741 


1. 


00 


62.41 


c 


ATOM 


1343 


CZ 


PHE 


A 


201 


21. 


064 


69. 


560 


16. 


131 


1. 


00 


62.12 


c 


ATOM 


1344 


CE2 


PHE 


A 


201 


22. 


242 


69. 


727 


16. 


856 


1. 


00 


61.55 


c 


ATOM 


1345 


CD2 


PHE 


A 


201 


23. 


464 


69. 


804 


16. 


190 


1. 


00 


61.34 


c 


ATOM 


1346 


C 


PHE 


A 


201 


24. 


957 


72. 


286 


14. 


245 


1. 


00 


62.42 


c 


ATOM 


1347 


O 


PHE 


A 


201 


25. 


712 


72. 


411 


15. 


214 


1. 


00 


62.75 


o 


ATOM 


1348 


N 


ASP 


A 


202 


24. 


012 


73. 


158 


13. 


934 


1. 


00 


63.58 


N 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



ATOM 


1349 


CA 


ASP A 202 


ATOM 


1350 


CB 


ASP A 202 


ATOM 


1351 


CG 


ASP A 202 


ATOM 


1352 


ODl 


ASP A 2 02 


ATOM 


1353 


0D2 


ASP A 202 


ATOM 


1354 


C 


ASP A 202 


ATOM 


1355 


0 


ASP A 202 


ATOM 


1356 


N 


GIjY a 203 


ATOM 


1357 


CA 


GLY A 203 


ATOM 


1358 


C 


GLY A 203 


ATOM 


1359 


O 


GLY A 203 


ATOM 


1360 


N 


THR A 204 


ATOM 
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CE2 
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ATOM 
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N 


SER A 208 


ATOM 


1398 


CA 


SER A 208 


ATOM 


1399 


CB 


SER A 208 


ATOM 
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OG 


SER A 208 


ATOM 


1401 


C 


SER A 208 


ATOM 


1402 


O 


SER A 208 


ATOM 


1403 


N 


PRO A 209 


ATOM 


1404 


CA 


PRO A 2 09 


ATOM 


1405 


CB 


PRO A 2 09 


ATOM 


1406 


CG 


PRO A 209 


ATOM 


1407 


CD 


PRO A 2 09 


ATOM 


1408 


C 


PRO A 2 09 


ATOM 


1409 


O 


PRO A 209 


ATOM 


1410 


N 


PRO A 210 


ATOM 


1411 


CA 


PRO A 210 


ATOM 


1412 


CB 


PRO A 210 


ATOM 


1413 


CG 


PRO A 210 


ATOM 


1414 


CD 


PRO A 210 


ATOM 


1415 


C 


PRO A 210 


ATOM 


1416 


O 


PRO A 210 


ATOM 


1417 


N 
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23.820 74.406 14.651 1.00 64.74 C 

24.100 75.583 13.704 1.00 65.58 C 

23.966 76.930 14.388 1.00 69.34 C 

24.626 77.141 15.440 1.00 71.91 O 

23.207 77.831 13.950 1.00 72.83 O 

22.397 74.467 15.198 1.00 64.11 C 

21.920 75.524 15.600 1.00 64.30 O 

21.716 73.324 15.202 1.00 63.47 N 

20.358 73.250 15.712 1.00 62.03 C 

20.346 72.947 17.200 1.00 60.94 C 

21.392 72.972 17.854 1.00 61.08 O 

19.158 72.643 17.727 1.00 59.86 N 

18.975 72.364 19.158 1.00 58.03 C 

17.481 72.402 19.547 1.00 57.90 . C 

16.900 73.630 19,090 1.00 56.77 O 

17.332 72.488 21.079 1.00 57-65 C 

19.574 71.032 19.575 1.00 57.57 C 

19.196 69,966 19,047 1.00 56.94 O 

20.487 71.106 20.545 1.00 56.60 N 

21.238 69.959 21.022 1.00 56.09 C 

22.204 70.417 22.124 1.00 56,67 C 

22.870 69.291 22.879 1.00 59.97 C 

24.127 69.719 23.631 1.00 63,64 C 

25.317 69.608 22.785 1.00 64.42 N 

26.049 68.501 22.667 1.00 65.48 C 

25.712 67.410 23.340 1.00 65.75 N 

27.114 68.476 21.872 1.00 64.31 N 

20.360 68.784 21.503 1.00 55.41 C 
20.536 67.630 21.069 1.00 55.27 O 
19.420 69.077 22.400 1.00 54.06 N 
18,634 68.037 23.067 1.00 52.40 C 
17.704 68,640 24.178 1.00 52.71 C 
18.516 69.018 25.416 1.00 50.99 C 
16.919 69.844 23.636 1.00 51.73 C 
17.799 67.291 22,048 1.00 51.99 C 
17.219 66.257 22.363 1.00 52.27 O 
17.731 67.834 20.830 1.00 50.50 N 
17.001 67.202 19.738 1.00 49.94 C 
16.126 68.236 19.021 1.00 49.34 C 
14.759 68.542 19.600 1.00 48.61 C 

14.604 69.438 20.679 1.00 49.28 C 
13.314 69.753 21.194 1.00 48.65 C 
12.182 69.164 20.590 1.00 50.59 C 
10.901 69,447 21.042 1.00 47.38 O 
12.332 68,284 19.488 1.00 48.14 C 

13.605 67.999 19.007 1.00 48.95 C 
17.982 66.571 18.718 1.00 49.22 C 
17.560 66.165 17.621 1.00 48.88 O 

19.269 66.529 19.085 1.00 48.13 N 

20.361 66.030 18.231 1.00 47.87 C 
21.667 66.791 18.496 1.00 48.10 C 
22.280 66.316 19.688 1.00 49.78 O 
20.620 64.566 18.503 1.00 46.42 C 
20.531 64.110 19.656 1.00 46.94 O 
20.941 63.826 17.449 1.00 44.93 N 
21.043 62.374 17.545 1.00 43.05 C 

20.979 61.948 16.083 1.00 43.11 C 
21.596 63.062 15.366 1.00 43.72 C 
21.165 64.293 16-070 1.00 45.04 C 
22,334 61.918 18.200 1,00 42.08 C 
23.303 62.675 18.235 1-00 40.92 O 
22.355 60.685 18.705 1.00 41.24 N 
23.546 60.167 19.374 1,00 42.13 C 
23.117 58.762 19,830 1.00 41.13 C 

21.980 58.403 18.942 1.00 41.77 C 

21.270 59.693 18.669 1.00 40.59 C 
24.768 60.119 18.442 1.00 43.19 C 
25.884 60.302 18.942 1.00 42.91 O 
24.567 59.901 17.138 1.00 43.80 N 
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CA 
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25.683 


ATOM 
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59.896 16.184 1.00 45.23 C 

59.400 14.780 1.00 44.59 C 

60.279 14.079 l.QO 42.32 C 

59.821 14,334 1.00 41.06 C 

59.217 15.394 1.00 38.90 O 

60.065 13.460 1.00 38-76 O 

61.263 16.095 1.00 46,56 C 

61.353 15.883 1.00 46.98 O 

62.331 16.284 1.00 48.34 N 

63.658 16.339 1.00 50.36 C 

64.769 16.186 1.00 50.50 C 

66.114 16.495 1.00 52.39 C 

66.830 17.652 1.00 53.01 C 

67.999 17.579 1.00 53.68 N 

68.052 16.368 1.00 53.34 C 

66.877 15.661 1.00 52.72 C 

66.692 14.373 1.00 52.54 C 

67,675 13.842 1-00 52,69 C 

68.832 14.575 1.00 52.71 C 

69.040 15.833 1.00 53.56 C 

63.835 17.622 1.00 51.79 C 

64.342 17.588 1.00 52.13 O 

63.388 18.743 1.00 53.62 N 

63.496 20.048 1.00 55.72 C 

62.917 21.183 1.00 55.41 C 

63.568 21.202 1.00 56.07 C 

65.083 21.258 1.00 57.43 C 

63.055 22.525 1.00 56.33 C 

62,771 20.050 1.00 57.10 C 

63.335 20.447 1.00 57.22 O 

61.524 19.591 1.00 58.27 N 

60.635 19.650 1.00 59.99 C 

59.190 19.585 1.00 60.48 C 

58.721 20.837 1.00 64.19 C 

57.237 20.844 1.00 68.93 C 

56.501 20.865 1.00 73.42 N 

55.185 21.015 1.00 76.10 C 

54.439 21,161 1.00 76.91 N 

54.609 21.013 1.00 76.26 N 

60.864 18.546 1.00 59,96 C 

60.803 18,812 1.00 60.37 O 

61.106 17.318 1.00 59.65 N 

61.161 16.159 1.00 59.46 C 

59.997 15.196 1.00 59.78 C 

58.648 15.858 1.00 61.91 C 

58.120 16.639 1.00 63.80 C 

56.877 17.247 1.00 64.01 C 

56.151 17.070 1.00 65.13 C 

54,910 17.657 1.00 66.40 O 

56.647 16.288 1.00 64.30 C 

57.885 15.692 1.00 63.31 C 

62.453 15.364 1.00 58.84 C 

62,556 14.356 1.00 58.96 O 

63.431 15.791 1.00 57.99 N 

64.666 15.020 1.00 57.44 C 

65.583 15,198 1.00 58-54 C 

66.231 16.547 1.00 62.71 C 

67.486 16.793 1.00 65.61 N 

67.796 18.065 1.00 67.49 C 

66.788 18.655 1.00 67.79 N 

65.793 17.730 1.00 65.96 C 

64.415 13.524 1.00 55.81 C 

65.135 12-653 1.00 55.97 O 

63.395 13.243 1.00 53.48 N 

63.041 11.881 1.00 51.48 C 

61-864 11-347 1.00 51,91 C 

62.132 11.121 1.00 54.97 C 

60.860 11.180 1.00 59.48 C 

60.059 9.957 1.00 63.43 N 
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7. 


.627 


ATOM 


1596 


CDl 


LEU 


A 


231 


7. 


.733 


ATOM 


1597 


CD2 


LEU 


A 


231 


7. 


.419 


ATOM 


1598 


C 


LEU 


A 


231 


7. 


.798 


ATOM 


1599 


O 


LEU 


A 


231 


6. 


. 949 


ATOM 


1600 


N 


LEU 


A 


232 


7. 


655 


ATOM 


1601 


CA 


LEU 


A 


232 


6. 


499 


ATOM 


1602 


CB 


LEU 


A 


232 


6. 


470 


ATOM 


1603 


CG 


LEU 


A 


232 


5. 


301 


ATOM 


1604 


CDl 


LEU 


A 


232 


3. 


947 


ATOM 


1605 


CD2 


LEU 


A 


232 


5. 


359 


ATOM 


1606 


C 


LEU 


A 


232 


6. 


439 


ATOM 


1607 


O 


LEU 


A 


232 


5. 


371 


ATOM 


1608 


N 


TYR 


A 


233 


7. 


571 


ATOM 


1609 


CA 


TYR 


A 


233 


7- 


646 


ATOM 


1610 


CB 


TYR 


A 


233 


9. 


068 


ATOM 


1611 


CG 


TYR 


A 


233 


9. 


209 


ATOM 


1612 


CDl 


TYR 


A 


233 


9. 


255 


ATOM 


1613 


CEl 


TYR 


A 


233 


9. 


353 


ATOM 


1614 


C2 


TYR 


A 


233 


9. 


407 


ATOM 


1615 


OH 


TYR 


A 


233 


9. 


505 


ATOM 


1616 


CE2 


TYR 


A 


233 


9. 


365 


ATOM 


1617 


CD2 


TYR 


A 


233 


9. 


267 


ATOM 


1618 


C 


TYR 


A 


233 


7. 


216 


ATOM 


1619 


0 


TYR 


A 


233 


6. 


416 


ATOM 


1620 


N 


ASP 


A 


234 


7. 


762 


ATOM 


1621 


CA 


ASP 


A 


234 


7. 


411 


ATOM 


1622 


CB 


ASP 


A 


234 


8. 


156 


ATOM 


1623 


CG 


ASP 


A 


234 


7, 


951 


ATOM 


1624 


ODl 


ASP 


A 


234 


8- 


206 



62.682 17.851 1.00 32.49 



62.479 19.173 1.00 32.64 ^ 

61.134 19.383 1.00 31.73 c 

60.485 18.148 1.00 31.24 c 
59.089 18.094 1.00 32.89 c 
58.391 19.261 1.00 32.32 c 
59.080 20.478 1.00 33.34 c 
60.444 20.552 1.00 32.43 c 
62.372 16.178 1.00 32.00 c 
62.224 17.192 1.00 31.94 o 
63.546 15.558 1.00 31.65 n 
64.752 16.073 1.00 31.57 c 
65.982 15.259 1.00 31.61 q 
66.048 14.016 1.00 33.45 o 
64.599 16.161 1.00 31.31 c 
65,151 17.072 1.00 30.56 o 
63.841 15.229 1.00 30.36 N 
63.545 15.217 1.00 30.48 C 
62.865 13.901 1.00 30.69 C 
63.767 12.661 1.00 31.45 C 
62.947 11.389 1.00 30.81 C 
64.917 12.784 1.00 30.15 C 

62.661 16.394 1.00 29.84 C 
62.815 16.958 1.00 30.50 O 
61.751 16.765 1.00 29.67 n 
60.895 17.920 1.00 29.67 C 
61.723 19.208 1,00 30.52 C 

61.486 20.105 1.00 28.98 O 
62.670 19.307 1.00 30.54 N' 
63.587 20.459 1.00 31.11 C 
64.564 20.334 1.00 31.68 C 
63.792 20.281 1.00 30.92 C 
62.992 21.546 1.00 30.66 C 
65.553 21.533 1.00 30.61 C 
64.397 20.532 1.00 31-56 C 
64,488 21.596 1.00 31.09 O 
64.928 19.373 1.00 30.97 N 
65,711 19.257 1.00 30.14 C 
66.239 17.828 1.00 30.10 C 
67.043 17.556 1.00 32.25 C 

68.372 18.310 1.00 30.47 C 
67.246 16.065 1.00 30.92 C 
64.950 19.689 1.00 30.59 C 
65.484 20,439 1.00 30.81 O 
63.721 19.210 1.00 29.58 N 
62,916 19.552 1.00 30.41 C 
61.609 18.745 1.00 30.17 C 
60.642 19.033 1.00 30.99 C 
61,346 18.919 1.00 33.55 C 
59.465 18.073 1.00 31.95 C 
62.630 21.062 1.00 30.78 C 
62,721 21.667 1.00 31.42 0 
62.272 21.650 1.00 30.02 N 
62.042 23.103 1.00 30.55 C 

61.662 23,526 1.00 30.26 C 

61.373 25.008 1.00 28.93 C 
62.416 25,930 1.00 29.15 C 
62.171 27.311 1.00 28.65 C 
60.882 27,769 1.00 31.81 C 
60.681 29.132 1.00 36.02 O 
59,801 26,878 1.00 30.59 C 
60.058 25.486 1.00 28.47 c 
63.306 23.834 1.00 31.37 C 
63.250 24,769 1.00 32-79 O 
64.434 23.407 1.00 31,52 N 
65.750 23,934 1.00 33.52 C 
66-833 23.162 1.00 34.26 C 
68.224 23.745 1.00 37.82 C 
68.450 24.956 1.00 39.31 O 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



ATOM 


1625 


0D2 


ASP A 234 


7.531 


ATOM 


1626 


C 


ASP A 234 


5. 923 


ATOM 


1627 


O 


ASP A 234 


5.368 


ATOM 


1628 


N 


MET A 235 


5.258 


ATOM 


1629 


CA 


MET A 235 


3.819 


ATOM 


1630 


CB 


MET A 235 


3.293 


ATOM 


1631 


CG 


MET A 235 


3.641 


ATOM 


1632 


SD 


MET A 235 


2.965 


ATOM 


1633 


CE 


MET A 235 


4 .174 


ATOM 


1634 


C 


MET A 235 


3.020 


ATOM 


1635 


0 


MET A 235 


2.133 


ATOM 


1636 


N 


VAL A 236 


3.322' 


ATOM 


1637 


CA 


VAL A 236 


2.518 


ATOM 


1638 


CB 


VAL A 236 


2. 405 


ATOM 


1639 


CGI 


VAL A 236 


1.757 


ATOM 


1640 


CG2 


VAL A 236 


3.763 


ATOM 


1641 


C 


VAL A 236 


2.955 


ATOM 


1642 


0 


VAL A 236 


2.225 


ATOM 


1643 


N 


CYS A 237 


4 .131 


ATOM 


1644 


CA 


CYS A 237 


4.642 


ATOM 


1645 


CB 


CYS A 237 


5.972 


ATOM 


1646 


SG 


CYS A 237 


5.796 


ATOM 


1647 


C 


CYS A. 237 


4.790 


ATOM 


1648 


0 


CYS A 237 


4 .983 


ATOM 


1649 


N 


GLY A 238 


4.722 


ATOM 


1650 


CA 


GLY A 238 


4.791 


ATOM 


1651 


C 


GLY A 238 


6.186 


ATOM 


1652 


0 


GLY A 238 


6.353 


ATOM 


1653 


N 


ASP A 239 


7.198 


ATOM 


1654 


CA 


ASP A 239 


8.580 


ATOM 


1655 


CB 


ASP A 239 


9.056 


ATOM 


1656 


CG 


ASP A 239 


10.214 


ATOM 


1657 


ODl 


ASP A 239 


10.586 


ATOM 


1658 


OD2 


ASP A 239 


10.822 


ATOM 


1659 


C 


ASP A 239 


9.418 


ATOM 


1660 


0 


ASP A 239 


8.957 


ATOM 


1661 


N 


ILE A 240 


10. 630 


ATOM 


1662 


CA 


ILE A 240 


11.529 


ATOM 


1663 


CB 


ILE A 240 


12.641 


ATOM 


1664 


CGI 


ILE A 240 


13.306 


ATOM 


1665 


CDl 


ILE A 240 


14.455 


ATOM 


1666 


CG2 


ILE A 240 


12.092 


ATOM 


1667 


C 


ILE A 240 


12.106 


ATOM 


1668 


0 


ILE A 240 


12.187 


ATOM 


1669 


N 


PRO A 241 


12.470 


ATOM 


1670 


CA 


PRO A 241 


12. 941 


ATOM 


1671 


CB 


PRO A 241 


12.879 


ATOM 


1672 


CG 


PRO A 241 


13. 150 


ATOM 


1673 


CD 


PRO A 241 


12. 444 


ATOM 


1674 


C 


PRO A 241 


14.361 


ATOM 


1675 


O 


PRO A 241 


14 . 639 


ATOM 


1676 


N 


PHE A 242 


15 . 243 


ATOM 


1677 


CA 


PHE A 242 


16. 644 


ATOM 


1678 


CB 


PHE A 242 


17.589 


ATOM 


1679 


CG 


PHE A 242 


17.145 


ATOM 


1680 


CDl 


PHE A 242 


16.885 


ATOM 


1681 


CEl 


PHE A 242 


16.496 


ATOM 


1682 


CZ 


PHE A 242 


16.367 


ATOM 


1683 


CE2 


PHE A 242 


16.618 


ATOM 


1684 


CD2 


PHE A 242 


17.012 


ATOM 


1685 


C 


PHE A 242 


17.104 


ATOM 


1686 


o 


PHE A 242 


16.783 


ATOM 


1687 


N 


GLU A 243 


17.884 


ATOM 


1688 


CA 


GLU A 243 


18.514 


ATOM 


1689 


CB 


GLU A 243 


18.204 


ATOM 


1690 


CG 


GLU A 243 


16.930 


ATOM 


1691 


CD 


GLU A 243 


16.911 


ATOM 


1692 


OEl 


GLU A 243 


17.901 


ATOM 


1693 


0E2 


GLU A 243 


15.894 
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69.156 23.030 1.00 39.97 o 

66.023 23.923 1.00 34.29 C 

66.524 24.931 1.00 35.28 0 

65.695 22.810 1.00 33.03 N 

65.904 22.713 1.00 33.96 C 

65.625 21.305 1.00 33.12 C 

66.708 20.286 1.00 36.42 C 

66.246 18.692 1.00 39-53 S 

65.260 18.147 1.00 43.23 C 

65.078 23.703 1.00 33.96 C 

65.607 24.337 1.00 35.00 O 

63.787 23.816 1.00 34.07 N 

62.893 24.660 1.00 35.02 C 

61.476 24.055 1.00 35.33 C 

61.562 22.673 1.00 34.30 C 

60.805 23.937 1.00 33,11 C 

62.843 26.129 1.00 35.63 C 
62.326 26.970 1.00 35.58 O 
63.389 26.432 1.00 36.11 N 
63.383 27.814 1.00 36.98 C 
62.630 27.909 1.00 36.35 C 

60.844 27.757 1.00 38.34 S 
64.778 28.416 1.00 37.40 C 
64.907 29.628 1.00 38.17 O 
65.812 27.576 1.00 36,49 N 
67.183 28.045 1.00 36.34 C 
67.717 28.259 1.00 37 , 6^ C 
68.842 28.719 1*00 37. "75 O 
66.916 27.939 1.00 38.11 N 
67.369 28.009 1.00 39.21 C 
67.339 29.458 1.00 40.7^ C 
68.302 29.735 1.00 45.40 C 
69.135 28.867 1.00 49*01 ^ O 
68.274 30.828 1.00 50.33 ; O 
66.434 27.142 1.00 39.28 C 
65.354 26.769 1.00 39.18 O 
66.85Ci 26.809 1.00 39.54 3^ 
66.066 25.983 1.00 39.95 C 
66.954 25.440 1.00 40.83 C 
67.740 26.578 1.00 41.83 C 
68.627 26.125 1.00 44.95 C 

67.911 24.344 1.00 39.77 C 
64.916 26.827 1.00 40.26 C 
65.046 28.049 1.00 40.89 O 
63.791 26.210 1.00 40.14 N 
62.620 26.971 1.00 41.03 C 
61.494 25.932 1.00 40.79 C 
62.187 24.622 1.00 39.44 C 
63.518 24.757 1.00 39,54 C 
62.737 27.548 1.00 42.89 C 
62.109 28.571 1.00 42.98 O 
63.508 26.912 1.00 44.80 N 
63.555 27.340 1.00 46.51 C 
62.944 26.285 1.00 45.41 C 
61.617 25.735 1-00 43.12 C 
60.545 26.578 1.00 42.42 C 
59.313 26-068 1.00 41.08 C 
59.148 24.676 1-00 43.10 C 
60.222 23.824 1.00 40.87 C 
61.438 24.350 1.00 42.39 C 
64.973 27.639 1.00 48.94 C 
65.913 26.903 1.00 48.57 O 
65-112 28.714 1.00 52.57 

66.391 29.046 1.00 55.94 C 

66.793 30.496 1.00 57.25 C 

67.634 30.664 1.00 62.51 C 

68.912 29.814 1.00 68.14 C 
69.697 29.854 1.00 69.83 O 
69.140 29.104 1.00 69.55 O 
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ATOM 


1694 


C 


GLU A 24 3 


20. 


022 


ATOM 


1695 


o 


GLU A 24 3 


20. 


596 


ATOM 


1696 


N 


HIS A 244 


20. 


654 


ATOM 


1697 


CA 


HIS A 244 


22. 


111 


ATOM 


1698 


CB 


HIS A 24 4 


22. 


634 


ATOM 


1699 


CG 


HIS A 244 


22. 


177 


ATOM 


1700 


NDl 


HIS A 244 


21. 


243 


ATOM 


1701 


CEl 


HIS A 244 


21. 


021 


ATOM 


1702 


NE2 


HIS A 244 


21. 


772 


ATOM 


1703 


CD2 


HIS A 244 


22. 


501 


ATOM 


1704 


C 


HIS A 244 


22. 


632 


ATOM 


1705 


o 


HIS A 244 


21. 


946 


ATOM 


1706 


N 


ASP A 245 


23. 


850 


ATOM 


1707 


CA 


ASP A 245 


24. 


536 


ATOM 


1708 


CB 


ASP A 245 


25. 


982 


ATOM 


1709 


CG 


ASP A 245 


26. 


093 


ATOM 


1710 


ODl 


ASP A 245 


25. 


109 


ATOM 


1711 


OD2 


ASP A 245 


27. 


132 


ATOM 


17X2 


C 


ASP A 245 


24. 


520 


ATOM 


1713 


0 


ASP A 245 


24 . 


240 


ATOM 


1714 


N 


GLU A 24 6 


24. 


807 


ATOM 


1715 


CA 


GLU A 246 


24. 


,814 


ATOM 


1716 


CB 


GLO A 24 6 


25. 


.227 


ATOM 


1717 


CG 


GLU A 246 


26. 


247 


ATOM 


1718 


CD 


GLU A 24 6 


25. 


.601 


ATOM 


1719 


OEl 


GLU A 24 6 


24, 


.864 


ATOM 


1720 


0E2 


GLU A 24 6 


■ 25. 


,824 


ATOM 


1721 


C 


GLU A 24 6 


23. 


.464 


ATOM 


1722 


O 


GLU A 24 6 


23. 


.405 


ATOM 


1723 


N 


GLU A 247 


22. 


.381 


ATOM 


1724 


CA 


GLU A 247 


21. 


.037 


ATOM 


1725 


CB 


GLU A 24 7 


19. 


.982 


ATOM 


1726 


CG 


GLU A 247 


20. 


.048 


ATOM 


1727 


CD 


GLU A 247 


19, 


.070 


ATOM 


1728 


OEl 


GLU A 247 


19, 


.189 


ATOM 


1729 


OE2 


GLU A 247 


18, 


.172 


ATOM 


1730 


C 


GLU A 247 


20 


.734 


ATOM 


1731 


O 


GLU A 247 


20 


.177 


ATOM 


1732 


N 


ILE A 248 


21 


.102 


ATOM 


1733 


CA 


ILE A 248 


20 


.964 


ATOM 


1734 


CB 


ILE A 248 


21 


.446 


ATOM 


1735 


CGI 


ILE A 248 


20 


.599 


ATOM 


1736 


CDl 


ILE A 248 


21 


.110 


ATOM 


1737 


CG2 


ILE A 248 


21 


.444 


ATOM 


1738 


C 


ILE A 248 


21 


.741 


ATOM 


1739 


O 


ILE A 248 


21 


.221 


ATOM 


1740 


N 


ILE A 24 9 


22 


. 977 


ATOM 


1741 


CA 


ILE A 24 9 


23 


.845 


ATOM 


1742 


CB 


ILE A 249 


25 


.315 


ATOM 


1743 


CGI .ILE A 24 9 


25 


.882 


ATOM 


1744 


CDl 


ILE A 24 9 


27 


.162 


ATOM 


1745 


CG2 


ILE A 249 


26 


.208 


ATOM 


1746 


C 


ILE A 249 


23 


.344 


ATOM 


1747 


O 


ILE A 249 


23 


.473 


ATOM 


1748 


N 


ARG A 250 


22 


.798 


ATOM 


1749 


CA 


ARG A 250 


22 


,259 


ATOM 


1750 


CB 


ARG A 250 


22 


.052 


ATOM 


1751 


CG 


ARG A 250 


21 


.612 


ATOM 


1752 


CD 


ARG A 250 


21 


.702 


ATOM 


1753 


NE 


ARG A 250 


21 


.290 


ATOM 


1754 


CZ 


ARG A 250 


20 


.217 


ATOM 


1755 


NHl 


. ARG A 250 


19 


.412 


ATOM 


1756 


NH2 


'. ARG A 250 


19 


.955 


ATOM 


1757 


C 


ARG A 250 


20 


. 949 


ATOM 


1758 


0 


ARG A 250 


20 


.617 


ATOM 


1759 


N 


GLY A 251 


20 


.224 


ATOM 


1760 


CA 


GLY A 251 


18 


.982 


ATOM 


17 61 


C 


GLY A 251 


17 


.855 


ATOM 


17 62 


O 


GLY A 251 


16 


.936 



66.364 28,813 1.00 56.70 C 

67.329 28.291 1.00 57.61 O 

65.250 29.169 1.00 57.00 N 
65.145 29.128 1.00 57.57 C 
64.653 30.484 1.00 57.93 C 
65.491 31.641 1.00 60.15 C 
65.040 32.563 1.00 61.46 N 
65.986 33.459 1.00 61.53 C 
67.040 33.145 1.00 61.93 N 
66.761 32.008 1.00 60.89 C 

64.251 27.999 1.00 57.12 C 

63.321 27.564 1.00 56.42 O 

64.550 27.542 1.00 56.65 N 

63.778 26.508 1.00 56.51 C 
64.254 26.364 1.00 56.84 C 

65.551 25.602 1.00 58.28 C 

66.322 25.555 1.00 60.57 O 
65.889 25.003 1.00 61.68 O 
62.289 26,792 1.00 55.85 C 
61.487 25,902 1,00 55.84 O 
61.926 28.038 1.00 55.12 N 
60.528 28.473 1.00 54.57 C 

60.414 29.948 1.00 55.49 C 
61.439 30.419 1.00 59.92 C 
62.745 30.853 1.00 64.57 C 
62.732 31.873 1.00 66.43 O 

63.779 30.165 1.00 66.00 O 
59.838 28.284 1.00 52.76 C 
58.637 27.998 1.00 51.94 O 
60.583 28.491 1.00 51.08 ' N 
60.031 28.289 1.00 50.00 C 
60-949 28.888 1.00 50.91 C 
61.069 30.398 1.00 54.76 C 
62.089 30.919 1.00 59.14 C, 
63.281 30.568 1.00 61.88 O 
61.693 31.672 1.00 63.68 O. 
59.785 26.810 1.00 47.56 C 
58.757 26.463 1.00 46.72 O 
60.738 25.957 1.00 46.28 N 
60.598 24.498 1.00 46.12 C 
61.876 23.754 1.00 45.90 C 
63.092 24.141 1-00 44.91 C 
64.419 23.588 1.00 44.29 C 
61.658 22.233 1.00 45.48 C 
59.390 23.988 1.00 46.47 C 
58.613 23.199 1.00 46.50 O 
59.223 24.462 1.00 46.70 N 
58.119 24.021 1.00 47.73 C 
58.342 24.516 1.00 48.27 C 
59.634 23.929 1.00 50.01 C 
60.114 24.638 1.00 54.07 C 
57.167 24.127 1.00 49.80 C 
56.752 24.463 1.00 47.21 C 
55.754 23.735 1.00 47.14 O 
56.697 25.671 1.00 46.59 N 
55.454 26.197 1.00 46.70 C 
55.573 27.712 1.00 46.73 C 
54.297 28.415 1.00 47.47 C 

54.415 29.942 1.00 49-11 • C 
53.191 30.631 1.00 50.87 N 
53.076 31.429 1.00 50.66 C 
54.117 31.656 1.00 46.65 N 
51.909 32.006 1.00 50.26 N 
55.097 25.483 1.00 46.42 C 
53.922 25.352 1.00 47.00 O 
56.113 25.018 1.00 46.84 N 
55.936 24.269 1.00 47.26 C 
55.180 24.968 1.00 47.36 C 
54.702 24.318 1.00 47.71 .0 



SUBSTITUTE SHEET (RULE 26) 



wo 2005/028624 



PCT/US2004/030360 



ATOM 


1763 


N 


GLN A 252 


17 . 921 


ATOM 


1764 


CA 


GLN A 252 


16 . 872 


ATOM 


1765 


CB 


GLN A 252 


17 . 438 


ATOM 


1766 


CG 


GLN A 252 


16 . 745 


ATOM 


1767 


CD 


GLN A 252 


17 .362 


ATOM 


1768 


OEl 


GLN A 252 


16 . 922 


ATOM 


1769 


NE2 


GLN A 252 


18 .381 


ATOM 


1770 


C 


GIjN a 252 


15 . 720 


ATOM 


1771 


O 


GLN A 252 


15 . 914 


ATOM 


1772 


N 


VAL a 253 


14 .534 


ATOM 


1773 


CA 


VAL A 253 


13 .366 


ATOM 


1774 


CB 


VAL A 253 


12.515 


ATOM 


1775 


CGI 


VAL A 253 


11.398 


ATOM 


1776 


CG2 


VAL A 253 


13 .386 


ATOM 


1777 


C 


VAL A 253 


12 .452 


ATOM 


1778 


O 


VAL A 253 


11 . 870 


ATOM 


1779 


N 


PHE A 254 


12.312 


ATOM 


1780 


CA 


PHE A 2 54 


11.415 


ATOM 


1781 


CB 


PHE A 2 54 


12.125 


ATOM 


1782 


CG 


PHE A 254 


11.181 


ATOM 


1783 


GDI 


PHE A 254 


10.698 


ATOM 


1784 


CEl 


PHE A 254 


9 . 794 


ATOM 


1785 


CZ 


PHE A 254 


9 .368 


ATOM 


1786 


CE2 


PHE A 254 


9 . 840 


ATOM 


1787 


CD2 


PHE A 254 


10 . 742 


ATOM 


1788 


C 


PHE A 254 


10 . 192 


ATOM 


1789 


O 


PHE A 254 


10.324 


ATOM 


1790 


N 


PHE A 255 


9 . Oil 


ATOM 


1791 


CA 
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55.067 26.290 1.00 46.77 N 

54.400 27.058 1,00 46.61 C 

53.965 28,410 1.00 47.77 C 

52.797 29.034 1.00 53.27 C 

51.495 28.593 1.00 58.82 C 

50.902 27.587 1.00 62.58 O 

51.040 29.328 1.00 59.71 N 

55.388 27.264 1.00 44,36 C 

56.458 27.842 1.00 43.90 O 

55.036 26.789 1.00 42.26 N 

55.917 26.887 1.00 41.30 C 

55.900 25.574 1.00 40.85 C 
56.917 25.657 1.00 41.13 C 
56.184 24,330 1.00 41.05 C 
55,558 28.080 1.00 40.21 C 
54.475 28.128 1.00 39.18 O 
56.494 29.004 1.00 40,53 N 
56.340 30.147 1.00 41.30 C 
56.749 31,446 1.00 42.46 C 
56.979 32.597 1.00 45,89 C 
55.890 33.354 1,00 48.15 C 
56.087 34.453 1.00 46.65 C 
57.391 34.762 1,00 47.61 C 
58.499 33.990 1.00 48.28 C 
58.287 32.922 1.00 47.87 C 
57.216 29.960 1.00 40.77 C 
58.388 29.630 1.00 40.62 O 
56.656 30.202 1.00 39.76 N 
57.377 30.041 1.00 40.05 C 
56.512 29.293 1.00 38.87 C 
56.408 27.844 1.00 38.12 C 
57.332 26,945 1.00 37.51 C 
57.267 25.588 1.00 37.08 C 
56.277 25.126 1.00 38.31 C 
55.353 26.034 1.00 37.42 C 
55.429 27.379 1.00 36.49 C 

57.901 31.355 1.00 40.75 C 
57.139 32.280 1.00 41.63 O 
59.214 31.414 1.00 42.10 N 
59.911 32.614 1.00 43.68 C 
61.291 32.722 1.00 44.33 C 
62.233 31.549 1.00 46.48 C 
63.508 31.606 1.00 49.15 C 
63.277 31.034 1.00 53.26 N 
64.018 31.293 1.00 54.62 C 

65.060 32.133 1.00 55.57 N 
63.720 30.716 1.00 52.61 N 
60.051 32.658 1.00 43.59 C 
60.251 33.724 1.00 44.72 O 
59.930 31.498 1.00 42.63 N 
59.949 31.403 1.00 41.06 C 

61.061 30.445 1.00 42.46 C 
62.451 30.901 1.00 46.35 C 
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63.504 33.020 1.00 55.05 N 
58.618 30.872 1.00 38.68 C 
57.849 30.300 1.00 38.45 O 
58.353 31.047 1.00 36.14 N 
57,177 30.479 1.00 36.06 C 
57.048 30.997 1.00 34.77 C 
55.659 30.847 1.00 39.04 C 
55.230 29.442 1.00 40.99 C 
53.789 29.300 1.00 40.39 N 
53.219 28.215 1.00 43.89 C 
53.966 27.158 1.00 45.06 N 
51.899 28.162 1.00 40.81 N 
57.345 28.950 1.00 35.62 C 
58.325 28.463 1.00 35.32 O 
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54.339 8.723 1.00 59.20 O 

52.113 8.708 1.00 62.36 N 

56.329 12.001 1.00 45.68 C 

56.708 13.177 1.00 45.87 O 
56.590 11.100 1.00 46.07 N 
57.331 11.410 1.00 46.57 C 
56.556 12.391 1.00 47.72 C 
55.099 12.018 1.00 52.03 C 
54.836 10.858 1.00 54.96 O 
54.164 12.805 1.00 56.19 O 
58.739 11.961 1.00 45.48 C 
59.181 12.880 1.00 44.34 O 
59.433 11.424 1.00 44.87 N 
60.787 11.882 1.00 44.61 C 
61.338 11.181 1.00 44.85 C 
61.662 9.735 1.00 44.77 C 
62.568 11.896 1.00 43.11 C 

61.709 11.640 1.00 44.94 C 
61.532 10.653 1.00 45.74 O 
62.668 12.538 1.00 44.60 N 
63.650 12.354 1.00 44.83 C 
64.416 13.641 1.00 43.88 C 
63.721 14.886 1.00 44.35 C 
64.760 15.972 1.00 42.61 C 
63.036 14.632 1.00 45.06 C 
64.655 11.312 1.00 45.35 C 
64.888 11.142 1.00 45.48 O 
65.266 10.629 1.00 46.10 N 
66.358 9.712 1.00 46.34 C 
66.586 8.790 1.00 47.42 * C 
65.968 7.380 1.00 50.24 C 
64.460 7.415 1.00 51.66 C 
66.288 6.595 1.00 54.37 C 
67.612 10.527 1.00 46.02 C 
67.723 11.693 1.00 44.80 O. 
68.544 9.955 1.00 46.46 N 
69.790 10.658 1.00 47.09 C " 
70.633 9.561 1.00 46.93 C 
69.609 8.741 1.00 45.84 C 
68.496 8.638 1.00 46.93 C 
70.465 11.266 1.00 48.28 C 
70.739 12.474 1.00 47.68 O 
70.684 10.480 1.00 50.01 N 
71.328 11.022 1.00 51.78 C 
71.636 9.933 1.00 52.87 C 
72.781 10.298 1.00 55.96 C 
74.122 10.632 1.00 60.05 C 
74.739 9.765 1.00 62.29 O 
74.570 11.884 1.00 60.16 N 
70.550 12.181 1.00 51.71 C 
71.140 13.201 1.00 52.21 O 
69.234 12.034 1.00 51.63 N 
68.391 13.108 1.00 51.67 C 
66,922 12.736 1.00 52.61 C 
66.406 11.732 1.00 56.45 C 
64.908 11.552 1.00 60.37 C 
64.480 10.874 1.00 61.21 O 
64.160 12.112 1.00 63.66 O 
68.586 14.372 1.00 50.78 C 
68.666 15.490 1-00 50.03 O 
68.624 14.186 1.00 49.28 N 
68.812 15.291 1.00 48.32 C 
68.855 14.781 1.00 48.02 C 
67.636 14.096 1.00 45.02 O 
68.910 15.951 1.00 47.18 C 
70.095 16.040 1.00 48.76 C 
70.101 17.268 1.00 47.62 O 
71.181 15.293 1.00 49.46 N 
72.468 15.887 1.00 50.94 C 
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Table 4 



HEADER 
COMPND 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMTWEIK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 
REMARK 



XX~XXX~XX xxxx 



REFINEMENT. 
PROGRAM 
AUTHORS 



REEmC 5.1.21 
MURSHUDOV, VAGIN, DODSON 



REFINEMENT TARGET : MAXIMUM LIKELIHOOD 



DATA USED IN REFINEMENT. 
RESOLUTION RANGE HIGH (ANGSTROMS) 
RESOLUTION RANGE LOW (ANGSTROMS) 
DATA CUTOFF ( SIGMA ( F) ) 

COMPLETENESS FOR RANGE (%} 
NUMBER OP REFLECTIONS 

FIT TO DATA USED IN REFINEMENT. 
CROSS-VALIDATION METHOD 
FREE R VALUE TEST SET SELECTION 
R VALUE (WORKING + TEST SET) 

R VALUE (WORKING SET) 

FREE R VALUE 

FREE R VALUE TEST SET SIZE (%) 
FREE R VALUE TEST SET COUNT 



2.03 
81.65 
NONE 
99.61 

25766 



THROUGHOUT 

RANDOM 

0.19077 

0.18920 , 

0.22121 

5.0 

1368 



FIT IN THE HIGHEST RESOLUTION BIN. 
TOTAL NUMBER OF BINS USED 
BIN RESOLUTION RANGE HIGH 
BIN RESOLUTION RANGE LOW 
REFLECTION IN BIN (WORKING SET) 

BIN R VALUE (WORKING SET) 

BIN FREE R VALUE SET COUNT 
BIN FREE R VALUE 





20 


2 


.030 


2 


.083 




1894 


0 


.289 




113 


0 


.297 



NUMBER OF NON-HYDROGEN ATOMS 
ALL ATOMS 



USED IN 
2400 



REFINEMENT. 



B VALUES. 
FROM WILSON PLOT (A**2) 
MEAN B VALUE (OVERALL, A**2) 

OVERALL ANISOTROPIC B VALCE. 



NULL 
27.297 



Bll 
B22 
B33 

B12 
B13 
B23 



(A**2) 

(A**2) 
(A**2) 
{A**2) 
(A**2) 
(A**2) 



0.50 
0.50 
-0.74 



,25 
.00 
,00 



ESTIMATED OVERALL COORDINATE ERROR. 
ESU BASED ON R VALUE 
ESU BASED ON FREE R VALUE 
ESU BASED ON MAXIMUM LIKELIHOOD 



(A) 
(A) 
(A) 



ESU FOR B VALUES BASED ON MAXIMUM LIKELIHOOD (A**2) 

CORRELATION COEFFICIENTS. 

CORRELATION COEFFICIENT FO-FC ; 0.959 

CORRELATION COEFFICIENT FO-FC FREE : 0.94 6 

RMS DEVIATIONS FROM IDEAL VALUES COUNT RMS 

BOND LENGTHS REFINED ATOMS (A): 2334 ; 0.011 

BOND ANGLES REFINED ATOMS (DEGREES): 3174 ; 1.105 



0.151 
0-140 
0.105 
3-960 



WEIGHT 

0.021 

1.959 
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REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

REMARK 

CISPEP 

CRYSTl 



3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
• 3 
3 
3 
3 
3 
3 
3 
3 
3 
1 



TORSION ANGLES, PERIOD 1 (DEGREES) ; 

CHIRAL-CENTER RESTRAINTS (A** 3) ; 

GENERAL PLANES REFINED ATOMS (A) : 

NON-BONDED CONTACTS REFINED ATOMS (A) ; 
H~BOND (X. . .Y) REFINED ATOMS (A) ; 

SYMMETRY VOW REFINED ATOMS (A) 
SYMMETRY H-BOND REFINED ATOMS (A) 



{A**2) 
(A** 2) 
(A**2) 



ISOTROPIC THERMAL FACTOR RESTRAINTS. 
MAIN-CHAIN BOND REFINED ATOMS (A**2) 
MAIN-CHAIN ANGLE REFINED ATOMS 
SIDE-CHAIN BOND REFINED ATOMS 
SIDE-CHAIN ANGLE REFINED ATOMS 

NCS RESTRAINTS STATISTICS 
NUMBER OF NCS GROUPS : NULL 



TLS DETAILS 
NUMBER OF TLS GROUPS : 2 

TLS GROUP : 1 

NUMBER OF COMPONENTS GROUP : 1 
COMPONENTS C SSSEQI TO C 
RESIDUE RANGE : A 33 A 

ORIGIN FOR THE GROUP (A): 65.5800 



273 




5.228 


/ 


5.000 


336 




0. 080 


r 


0.200 


1800 


/ 


0. 004 


} 


0.020 


1070 




0 . 202 


r 


0. 200 


150 


/ 


0 . 145 


r 


0.200 


46 




0. 199 


r 


0. 200 


10 




0.267 


7 


0.200 


COUNT 


RMS 




WEIGHT 


1365 


/ 


0.799 


t 


1.500 


2214 


/ 


1.519 


f 


2.000 


969 




2.024 


t 


3.000 


960 




3.247 


r 


4.500 



SSSEQI 
306 
27.1270 



T 


TENSOR 
















Til: 


0. 


1410 


T22: 


0. 


1266 






T33: 


0. 


0824 


T12: 


-0. 


0364 






T13: - 


0. 


0112 


T23: 


-0. 


0301 




L 


TENSOR 
















Lll: 


1. 


3945 


L22- 


0. 


7253 






L33: 


0. 


8680 


L12 


0. 


1248 






L13: - 


-0. 


3386 


L23 


0. 


0070 




S 


TENSOR 
















Sll: - 


-0. 


0568 


S12 


: 0. 


0858 


S13 




S21: - 


'0. 


0201 


S22 


: 0. 


1089 


S23 




S31: 


0. 


0298 


S32 


; 0. 


0689 


S33 


TLS GROUP 






2 









NUMBER OF COMPONENTS GROUP 
COMPONENTS C SSSEQI 



TO 



RESIDUE RANGE : L 

ORIGIN FOR THE GROUP 

T TENSOR 

Til: 0.1174 T22: 
T33: 0.1391 T12: 
T13: -0.0486 T23: 

L TENSOR 

Lll: 14.6629 L22: 
L33: 8.9109 Ll2: 
L13: -1.8290 L23: 

S TENSOR 



(A) 



1 

C 
L 

73-8810 



0.0787 
0.0287 
-0.0421 



SSSEQI 
1 

32.6080 



Sll; 
S21: 
S31: 



0.2483 S12: 
0.0302 S22: 
-0.4688 S32: 



0. 


1943 




-0. 


1003 




-0. 


0722 




15. 


7148 




11. 


1563 




15. 


1157 




0. 


1966 


S13 


0. 


1725 


S23 


0. 


8689 


S33 



0.0403 
0.8712 
-0.4208 



-0.6960 



1.3720 



BULK SOLVENT MODELLING, 
METHOD USED : BABINET MODEL WITH MASK 
PARAMETERS FOR MASK CALCULATION 



VDW PROBE RADIUS 
ION PROBE RADIUS 
SHRINKAGE RADIUS 



,40 
.80 
.80 



OTHER REFINEMENT REMARKS: NULL 



GLU A 
95.566 



124 PRO A 125 
95.566 80.862 90.00 



0. 

90.00 120.00 P 



00 
65 
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SCALEl 

SCALE2 

SCALE3 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



1 
2 

3 

5 
6 
7 
8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 



0,010464 0. 
0.000000 0. 
0.000000 0. 

N PRO A 
CA PRO A 
PRO A 
PRO A 
PRO A 
PRO A 
PRO A 
LEU A 
LEU A 
LEU A 
LEU A 
A 
A 
A 
A 
A 
A 



CB 
CG 
CD 
C 
O 
N 

CA 
CB 
CG 
CDl LEU 
CD2 LEU 



C 
O 
N 

CA 
CB 
CG 
CD 



LEU 
LEU 
GLU 
GLU 
GLU A 
GLU A 
GLU 



OEl GLU 
0E2 GLU 



GLU 
GLU 
SER 
SER 
SER 
SER 
SER 
SER 
GLN 
GLN A 
GLN A 
GLN A 



C 
O 
N 
CA 
CB 
OG 
C 

o 

N 
CA 
CB 
CG 

CD GLN A 
OEl GLN A 
GLN A 
GLN A 
GLN A 
TYR 
TYR 
TYR 
TYR 



NE2 
C 
O 
N 

CA 
CB 
CG 



CDl TYR 
CEl TYR 
CZ TYR 
OH TYR 
CE2 TYR 
CD2 TYR 



C 

O 

N 

CA 

CB 

CG 

CD 



TYR 
TYR A 
GLN A 
GLN 
GLN 
GLN 
GLN 



OEl GLN 
NE2 GLN 
C 
O 
N 

CA 
CB 



A 
A 
A 
A 
A 
A 

GLN A 
GLN A 
VAL 
VAL 
VAL 



CGI VAL 
CG2 VAL 
C VAL 



006041 


0 . 000000 




0.00000 






012083 


0 . 000000 




0 . 00000 






000000 


0 . 012367 




0.00000 






33 


89.14 9 


40.408 - 


-18 . 445 


1.00 67.30 


N 


33 


88 . 47 6 


41 . 476 ■ 


-17 . 647 


1.00 67.14 


C 


33 


86. 997 


41.088 ■ 


-17 .742 


1.00 67.23 


C 


33 


86. 877 


40.393 ■ 


-19.088 


1.00 67.40 


C 


33 


88 .243 


39.825 • 


-19. 451 


1.00 67.35 


C 


33 


88 . 938 


41.544 • 


-16.180 


1.00 66.89 


c 


33 


89. 154 


42.657 ■ 


-15. 690 


1.00 67.00 


0 


34 


89 . 091 


40.388 ■ 


-15.519 


1.00 66.32 


N 


34 


89,4 99 


40.274 ■ 


-14.100 


1.00 65.68 


C 


34 


90.888 


40.895 


-13.835 


1.00 65.85 


C 


34 


91 . 302 


41 .230 


-12 . 390 


1.00 66.30 


C 


34 


91-714 


39- 982 


-11.600 


1.00 66.80 


c 


34 


92 . 418 


42.268 


-12.376 


1,00 67.23 


c 


34 


88 .454 


40.795 


-13.100 


1.00 64.94 


c 


34 


87 . 873 


41.869 


-13.284 


1.00 64.93 


0 


35 


88.242 


40 .036 


-12.027 


1.00 63.77 


N 


35 


87 . 186 


40.343 


-11. 061 


1,00 62.65 


C 


35 


86.798 


39.087 


-10.270 


1.00 63.13 


C 


35 


87 . 856 


38 .599 


-9.297 


1.00 65.02 


c 


35 


87.245 


37 . 871 


-8.122 


1.00 67.48 


c 


35 


87 . 017 


38 . 518 


-7 . 069 


1.00 68.53 


o 




86 . 987 


36 . 654 


-8.260 


1.00 68.44 


o 


35 


87.444 


41 . 549 


-10. 130 


1.00 61.12 


c 




86. 859 


41 . 645 


-9.049 


1.00 61.05 


o 


o o 


88,299 


42 . 477 


-10 . 561 


1.00 59.16 


N 


^ V 


88 . 360 


43 .797 


-9. 923 


1.00 56,76 


c 




89 • 7 67 


44 . 375 


-9. 977 


1.00 57.09 


c 




90 . 185 


44 . 711 


-8. 665 


1.00 57.93 


o 


J u 


87 - 321 


44 .757 


-10.537 


1.00 54.68 


c 




87 . 465 


45. 987 


-10.482 


1.00 54.42 


o 


o / 


86 . 278 


44 . 159 


-11-121 


1.00 51.69 


N 


o / 


B5 055 


44 . 844 


-11.531 


1.00 48.71 


C 


J / 


84 .2 88 


44 . 000 


-12.556 


1.00 48-70 


C 


T7 


85 . 032 


43 . 705 


-13.853 


1.00 48-89 


C 


"^7 


84 . 357 


42.614 


-14 . 681 


1.00 48.93 


C 


■^7 


83 . 235 


42 .790 


-15.159 


1.00 48.57 


o 


'^7 


85 . 041 


41.490 


-14 . 849 


1.00 49.23 


N 


"^7 


84 . 159 


45. 068 


-10.309 


1.00 46.45 


c 


"^7 


83 . 061 


45. 621 


-10.425 


1.00 45.80 


0 


•J o 


84 . 634 


44 . 634 


-9. 142 


1.00 43.79 


N 


38 


83 . 815 


44.599 


-7.935 


1.00 41.47 


c 


38 


83 . 277 


43.178 


-7. 683 


1,00 40.80 


c 


38 


82.415 


42.680 


-8.813 


1.00 37.71 


c 


38 


81.078 


43.060 


-8.913 


1.00 36.03 


c 


38 


80 . 283 


42.622 


-9.970 


1.00 34.69 


c 


38 


80 . 834 


41.799 


-10.940 


1.00 33.35 


c 


38 


80. 058 


41.362 


-11.982 


1.00 33.33 


0 


38 


82 . 156 


41.409 


-10.861 
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40 
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c 


40 
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c 


40 


83.794 
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1.00 33.17 


• c 
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83.773 
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A 


51 


81.640 


ATOM 


134 
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VAL 


A 


52 


80:i50 


ATOM 


135 


CA 


VAL 


A 


52 
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45.296 -0.021 1.00 32.88 O 

44.047 0.094 1.00 31.35 N 

44. SOB 1.407 1.00 29,70 C 

43.395 2.438 1.00 28.50 C 

42.480 2.315 1.00 28.62 O 

43,470 3.446 1.00 27.60 N 

42.494 4.543 1.00 27.00 C 

43.073 5-501 1,00 27.10 C 

44.506 5.089 1.00 27.33 C 

44.521 3.618 1,00 27.63 C 

41.091 4.127 1-00 26.95 C 

40.921 3.149 1.00 26.22 O 

40.114 4.905 1.00 26.40 N 

38.726 4.827 1.00 26.26 C 

37.888 5.737 1.00 26.16 C 

36.367 5.836 1.00 26.71 C 

35.666 4.486 1.00 25.17 C 

35.790 6.911 1.00 25.63 C 

38.592 5.292 1.00 26.23 C 

39.010 6.409 1.00 25.53 O 

37.998 4.439 1.00 25.89 N 

37.786 4.746 1.00 26.23 C 

37.994 3.500 1.00 25.73 C 

39.383 2.881 1.00 25.94 C 

39.381 1.516 1.00 24.87 C 

40.399 3.846 1.00 26.24 C 

36.416 5.345 1.00 26.74 C 

36.274 6.148 1.00 26.63 O 

35.409 4.946 1.00 27.42 N 
34.056 5.434 1.00 28.81 C 
33.077 4.932 1.00 29.90 C 
33 „ 254 3.854 1.00 29.36 O 
32,061 5.745 1.00 31.14 N 
30.964 5.385 1.00 33.14 C 
31. 82 5.786 1.00 33.24 C 
31. a7 7.161 1.00 31.40 O 
29. '.>3 6.104 1.00 34.64 C 
29.7;.7 6.594 1.00 35.16 O 
28.G^5 6.168 1.00 36.14 N 
27.513 6.960 1.00 37.77 C 

26.410 6.202 1.00 38.06 C 
25.261 6.665 1.00 38.85 O 
26.756 5.058 1-00 38.13 N 
25.765 4.072 1.00 37.47 C 
25.489 3.163 1.00 37-20 C 
25.459 3.631 1.00 37.27 O 
25.324 1.865 1.00 36.60 N 
25.099 0.905 1.00 36.12 C 
24.532 -0.421 1.00 36.73 C 
23.153 -0.310 1.00 39.10 C 
22.960 -0.454 1.00 40.06 C 
21.681 -0.357 1.00 42.14' C 
20.575 -0.105 1.00 43.04 C 
20.755 0.045 1.00 43.48 C 
22.043 -0.061 1.00 42.11 C 
26.339 0.614 1.00 34.83 C 
26.195 0.286 1.00 35.14 O 
27.536 0.717 1.00 32.72 N 
28.772 0.303 1.00 30.18 C 
29.895 1.335 1.00 28.53 C 
29.832 2.367 1.00 28,21 O 
30.895 1.061 1.00 26.40 N 
32.156 1.803 1.00 23.83 C 
32.509 2.190 1.00 24.19 C 
31.474 2.937 1.00 23.82 O 
33.245 0.888 1.00 22.65 C 
33.423 -0.242 1.00 21.83 O 
33.948 1.369 1.00 20.93 N 
34.917 0.568 1.00 20.09 C 
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34.574 0.518 1.00 19.63 C 

35.536 -0.391 1.00 19.68 C 

33.133 0.035 1.00 19.27 C 

36.330 1.119 1.00 20.59 C 

36.576 2.309 1,00 19.74 O 

37.241 0.250 1.00 20.61 N 

38.624 0.640 1.00 21.62 C 

39.034 0.247 1,00 21.19 C 

38.256 0.922 1,00 22.16 C 

36.980 0.470 1.00 21.94 C 

36.265 1.078 1.00 22.80 C 

36.830 2.146 1.00 23.82 C 

36.114 2.736 1.00 23.97 O 

38.099 2.614 1.00 23.07 C 

38.810 1.994 1.00 23.01 C 

39.597 -0.024 1.00 22.34 C 

39.373 -1.153 1.00 22.35 O 

40.681 0.680 1.00 23.48 N 

41.822 0.074 1.00 24.82 C 

42.840 1.140 1.00 25.09 C 

43.939 0.545 1.00 25.39 O 

42.461 -0.889 1.00 25.68 C 

42.513 -0.616 1.00 25.83 O 

42.932 -2.023 1 . 0 0.,,^2 6 . 75 N 

43.609 -2.991 1.00^^17.76 C 

4 4.633 -3.827 1.00 28V87 C 

44.772 -3.737 1.00 28.62'^*- O 
45.358 -4.641 1.00 30.03, N 
46.346 -5.557 1.00 31.52 ' C 

47.773 -4.953 1.00 31.73 C 
47.955 -3.790 1.00 32.00 C 
48.992 -2.750 1.00 34.29 C 
48.830 -6.014 1.00 32.25 C 
46.274 -6.901 1.00 32.27 C 
46.245 -6.956 1.00 32.20- O 
46.225 -7,976 1.00 33.45 N 

46.308 -9,332 1.00 34.77 .C 
45.902 -10.319 1.00 34.53 C 
45.541 -11.692 1.00 35.85 C 
45.746 -12.766 1.00 37.31 C 
44.488 -13.210 1.00 38.72 N 
44.334 -13.580 1.00 39.18 C 
45.360 -13.561 1.00 38.58 N 
43.143 -13.967 1.00 40.00 N 
47.740 -9.604 1.00 35.42 C 
48.667 -9.609 1.00 35.07 O 
47,901 .-9.822 1.00 36.96 N 
49.221 -9.962 1.00 38.44 C 
49.118 -10.083 1.00 38.53 C 
50.494 -10.251 1.00 38.65 C 
48.422 -8.863 1.00 38.97 C 
50.033 -11.125 1.00 39.26 C 
51.238 -10.989 1.00 39.41 0 
49.357 -12.239 1.00 40.32 N 
50.007 -13.459 1.00 41.50 C 
49.012 -14.625 1.00 41.60 C 

47.777 -14.225 1.00 42.83 O 

50.778 -13.310 1.00 41.87 C 
51.830 -13.933 1.00 42.28 O 
50.269 -12.488 1.00 42.04 N 
50.897 -12.330 1.00 41.97 C 
50.187 -13.223 1.00 42.37 C 
48.739 -12.810 1.00 43.55 ^ C 

48.309 -11.781 1.00 44.32 O 
47.959 -13.451 1.00 43.86 O 
50.996 -10.878 1.00 41.41 C 
51.372 -10.657 1.00 41.59 O 
50.667 -9.905 1.00 40,49 N 
50.639 -8.475 1.00 39.60 c 
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27.707 -16.379 1.00 28.72 O 

28.239 -14.480 1.00 30.05 N 

28.940 -13.899 1.00 31.67 C 

28.631 -12.383 1.00 31.32 C 

27.139 -12.159 1.00 30.44 C 

29.149 -11.546 1.00 29.93 C 

30.458 -14.119 1.00 33.63 C 

31.112 -13.783 1.00 33.80 O 

31.018 -14.676 1.00 35.26 N 
32.455 -14.979 1.00 37.17 C 
32.907 -15.483 1.00 37.37 C 
32.265 -16.699 1.00 38.96 O 
32.865 -15.962 1.00 38.05 C 
32.223 -16.996 1.00 38.07 0 
33.934 -15.603 1.00 39.50 N 
34.521 -16.409 1.00 40.49 C 
33.499 -16.685 1.00 40.48 C 
33.547 -15.681 1.00 41.42 O 
35.738 -15.656 1.00 40.82 C 
36.091 -14.582 1.00 41.27 0 
36.364 -16.210 1.00 40.95 N 
37.535 -15.602 1.00 40.62 C 
37.294 -14.283 1.00 40.18 C 
38.263 -13.600 1.00 '40.34 O 

36.019 -13.942 1-00 39.30 N 
35.628 -12.694 1.00 38.13 C 
34.135 -12.709 1.00 38.35 C 
33.656 -11.471 1.00 37.90 C 
34.258 -11.069 1.00 36.99 C 
33.801 -9.924 1,00 37.18 C 
32.727 -9.177 1.00 35.96 C 
32.116 -9.573 1.00 35.95 C 
32.585 -10.718 1.00 37.98 C 
35,944 -11.491 1.00 37.35 C 
35,409 -11.354 1.00 37.45 O 
36.814 -10.620 1.00 36.17 N 
37.263 -9.468 1.00 35.23 C 
38.647 -9.003 1.00 35.81 C 
38.612 -8.570 1.00 37.04 O 
36.280 -8.291 1.00 33.45 C 
36.544 -7.295 1.00 33.93 O 
35.157 -8.402 1.00 31.13 N 
34.190 -7.310 1.00 28.02 C 
33.377 -7.066 1.00 26.06 C 
32.736 -6.040 1.00 24.79 O 
33.396 -8.030 1.00 25.16 N 
32.726 -7.877 1.00 24.40 C 
31.540 -8.863 1.00 24.34 C 
30.797 -8.585 1.00 25.25 C 
30.567 -8.737 1.00 23.66 C 
33.759 -8.104 1.00 24.50 C 
34.566 -9.040 1.00 24.27 O 
33.744 -7.232 1.00 23.78 N 
34.576 -7.383 1.00 24.18 C 
34,185 -6.324 1.00 24.30 C 
35.394 -5.954 1.00 23.11 C 
36.449 -5.162 1.00 22.00 C 
32.934 -6.766 1.00 23.91 C 
34.483 -8.789 1.00 24.77 C 
33.403 -9.366 1.00 25.02 O 
35.619 -9.337 1.00 25.08 N 
35.622 -10.667 1.00 26.17 C 
36.833 -11.454 1.00 27.31 C 
37.173 -12.714 1.00 32.33 C 
38.311 -13.536 1.00 39.80 C 
38.040 -13.891 1.00 45.22 N 
38.838 -14.646 1.00 47.93 Q 
39.976 -15.139 1.00 48.73 N 
38.498 -14.908 1.00 48.97 N 
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35.590 -10.594 1.00 25.19 C 
36.402 -9.885 1.00 24.65 0 
34.634 -11.304 1.00 24.85 N 
34.609 -11.499 1.00 24.90 C 
33.223 -11.918 1.00 24.41 C 
33.073 -12.127 1.00 24.89 C 
33.148 -10.796 1.00 24.67 C 
31.790 -12.871 1.00 23.97 C 
35.639 -12.554 1.00 25.16 C 
35.565 -13.695 1.00 25.41 o 
36.604 -12.163 1.00 25.22 N 
37.703 -13.046 1.00 25.72 C 
38.991 -12.240 1.00 25.54 C 
39.501 -11.524 1.00 25.95 C 
40.685 -10.620 1-00 25.00 C 
39.868 -12.525 1.00 27.51 C 
37.363 -13.847 1.00 25.66 C 
37.678 -15.040 1.00 25.68 O 
36.717 -13.177 1.00 25.47 N 
36.333 -13.762 1.00 25.47 C 
37.557 -13.872 1.00 25.78 C 
37.485 -15.071 1.00 27.00 C 
36.385 -15.610 1.00 27.45 O 
38.501 -15.537 1.00 29.70 0 
35.308 -12.856 1.00 25.21 C 
35.119 -11.711 1.00 23.96 O 
34.669 -13.369 1.00 25.09 N 
33.785 -12.565 1.00 25.83 C 
32.360 -12.565 1.00 26.03 C 
31.729 -13.918 1.00 27.82 C 
31.818 -14.883 1.00 28.39 C 
31.102 -15.995 1.00 30.13 N 
30.539 -15. '770 1.00 30.67 C 
30.914 -14.465 1.00 29.83 C 
30.460 -13.987 1.00 30.76 C 
29.664 -14.809 1.00 32.48 C 
29.313 -16.107 1.00 33.06 C 
29.739 -16.602 1.00 32.39 C 
33.808 -13.045 1.00 26.10 C 
34.170 -14.191 1.00 25.61 O 
33.425 -12.155 1.00 26.36 N 
33.407 -12.449 1.00 26-90 C 
34.607 -11.796 1.00 27.07 C 
35.945 -12.285 1.00 28.66 C 
36.602 -13.330 1.00 30.61 C 
37.851 -13.785 1.00 31.43 C 
38.446 -13.195 1.00 31.46 C 
37.795 -12.153 1.00 30.52 C 
36.556 -11.701 1.00 28.63 C 
32.117 -11.924 1.00 27.02 C 

31.591 -10.879 1.00 26.50 O 
31.614 -12.655 1.00 26.99 N 
30.500 -12.201 1.00 27.50 C 
29,476 -13.323 1.00 27.99 C 
28.179 -12.828 1.00 28.36 C 
27.116 -13.894 1.00 29.75 C 
27.215 -14.989 1.00 31.47 O 
26.154 -13.622 1.00 30.33 O 
31.023 -11.696 1.00 27.98 C 
31.891 -12.310 1.00 28.26 O 
30.490 -10.560 1.00 28.31 N 
30.760 -10.004 1.00 28.18 C 
31.378 -8.614 1.00 28.37 C 
32-883 -8.522 1.00 27.57 C 
33.452 -7.293 1.00 27.27 C 
34.899 -7.282 1.00 26.93 N 
35.671 -6.233 1.00 27.51 0 
35.146 -5.079 1.00 28.76 -N 
36.979 -6.339 1.00 26.92 N 
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63.625 43,217 -3.482 1.00 24.73 O 
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63.132 44.012 3.835 1.00 25.33 C 

63.764 42.908 4.700 1.00 25.91 C 
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62.544 48.534 0.690 1.00 28.73 C 

62.308 48.600 1.904 1.00 31.92 O 

63.382 49.361 0.078 1.00 30.86 N 

59.857 46.042 -0.307 1.00 26.01 C 

58.771 46.630 -0.338 1.00 25.84 O 
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28.206 14.966 1.00 21.28 C 

28.677 15.259 1.00 21.63 O 

26.906 14.743 1.00 21.89 N 

25.904 14.741 1.00 23.14 C 

25.466 16.161 1.00 24.31 C 

25.367 16.363 1.00 28.93 C 

24.729 15.530 1.00 31.83 O 

25.919 17.333 1.00 34.08 O 

24.671 13.918 1.00 22.10 C 

24.508 13.582 1.00 22.05 O 

23.823 13.590 1.00 21.41 N 

22.616 12.803 1.00 21.20 C 

22.130 12.119 1.00 21.27 C 

21.934 13.162 1-00 22.95 C 

21.359 12.604 1.00 24.55 C 

23.113 11.020 1.00 22.14 C 

21.516 13.687 1.00 21.05 C 

21,507 14.896 1.00 20.43 O 

20.610 13.112' 1.00 21.20 N 

19.609 13.924 1.00 22.14 C 

19.070 12.967 1.00 22.34 C 

19.243 11.581 1.00 20.54 C 

20.461 11.670 1.00 21.11 C 
18.466 14.413 1.00 23.21 C 
17.885 15.443 1.00 23.48 O 
18.143 13.699 1.00 24.34 N 
16.949 13.999 1.00 25.31 C 

15.894 12.881 1.00 24.74 C 
15.596 12.440 1.00 22.55 C 
15.212 13,354 1.00 21.74 C 
14.916 12-923 1.00 21.18 C 
15.011 11.556 1.00 21.06 C 
15.392 10.642 1.00 20.29 C 
15.672 11.088 1.00 21.16 C 
17.286 14.169 1.00 27.27 C 
18.038 13.370 1-00 27-35 O 
16.692 15.186 1.00 29.13 N 
16.849 15.428 1.00 31.65 C 
17.361 16.849 1.00 32.51 C 
18.723 17.153 1.00 37.83 C 

19.895 16.505 1.00 43.98 C 
19.903 15.260 1.00 46.61 O 
20.824 17.241 1.00 46.61 O 
15.568 15.187 1.00 31.62 C 
15.630 14.759 1.00 32.81 O 
14.409 15.454 1.00 31.55 N 
13-146 15.307 1.00 31.42 C 
12.423 16.650 1.00 32.01 C 

13- 265 17.738 1.00 34.56 C 
13.790 18.763 1.00 36.67 N 

14- 502 19.561 1.00 37.86 C 

14.462 19.087 1.00 37.93 N 
13.697 17.945 1.00 36.34 C 
12.235 14.243 1.00 30.37 C 
12.327 13.941 1.00 29.56 O 
11.360 13.689 1,00 29.19 N 
10.373 12.691 1.00 28.54 C 

9.315 12.473 1.00 28.34 C 

9.853 11.726 1.00 28.71 C 

11.016 11.252 1.00 28.60 O 

9.174 11.572 1-00 30.07 O 

9.670 13.103 1.00 28.04 C 

9.485 12-284 1.00 27.65 O 

9.292 14.381 1.00 27.40 N 

8.563 14.978 1.00 27-37 C 

8.286 16.468 1.00 28.11 C 

7.369 16.736 1.00 32.49 C 

8.072 16.650 .1.00 37.16 C 

9-255 17.037 1.00 39.44 O 
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1 . 00 


25 


.30 


c 


ATOM 


1765 


O 


GLY A 251 


55 . 074 


IZ . Z4o 


XX • 4 y o 


1 nn 


25 


.76 


0 


ATOM 


1766 


N 


GLN A 252 


56. 177 


IT Q T7 
11 •Oil 


X J . ^Zo 


1 nn 

X ■ UU 


25 


.26 


N 


ATOM 


1767 


CA 


GLN A 252 


55 . 082 


1 o "ST "a 
IZ . Ji / J 


X4 . ZOo 


1 nn 


25 


.84 


c 


ATOM 


1768 


CB 


GLN A 252 


55 . OOO 


1 1 en's 


XO • O 7o 


1 no 

X • u u 


26.06 


c 


ATOM 


1769 


CG 


GLN A 252 


53 . 79d 


1 1 Q'a'7 

XX - y J f 


X O . fi DO 


1 - 00 


29 


.12 


c 


ATOM 


1770 


CD 


GLN A 252 


52 .439 


XX . b4 y 


T c oo"7 
XO . OO / 


1 nn 

X * UU 


32 


.13 


c 


ATOM 


1771 


OEl 


GLN A 252 


ol . O J / 


1 o ^^^^ 

XZ . DUO 


1 ^ fl Rd 


1 . 00 


31 


.35 


0 


ATOM 


1772 


NE2 


GLN A 252 


CO O QO 


XU . ^HiO 


JL O . ^ Drt 


1 - 00 


32 


.42 


N 


ATOM 


1773 


C 


GLN A 252 


55 , Z /X 


in Q c "a 
X J . o Do 


X fl . OU^ 


1 on 

X . u u 


25 


.11 


c 


ATOM 


1774 


0 


GLN A 252 


56 . 310 


X 4 . JUo 


X o . U U D 


1 . 00 


24 


.98 


0 


ATOM 


1775 


N 


VAL A 253 


54 . Zbo 


1 A C^'>.A 


1 A TOO 
Xfi . X/CO 


1 no 

J. • UU 


24 


.33 


N 


ATOM 


177 6 


CA 


VAL A 253 


54 , 351 


1 c n Q 
X D . UoD 


X fl . X y D 


1 on 

X ■ u u 


24 


.36 


Q 


ATOM 


1777 


CB 


VAL A 253 


53 . 751 


1 C T A 
X D • / OU 


XZ . yzz 


1 nn 

X . UU 


24 


.24 


c 


ATOM 


1778 


CGI 


VAL A 253 


53 . 9 /I 


1 Q o An 

Xo . Z 4 U 


TO Q /I R 
XZ . O 


^ nn 

X . u u 


24 


.73 


c 


ATOM 


1779 


CG2 


VAL A 253 


54 . 356 


xo . XZ4 


1 1 (\A 1 

XX ■ o^ / 


1 nn 

X . UU 


24 


.87 


c 


ATOM 


1780 


C 


VAL A 253 


53 . 601 


lb • o / o 


1 c 401 
xo • 40X 


1 nn 

X . UU 


23 


.69 




ATOM 


1781 


0 


VAL A 253 


52 . 427 


1 C 0*7 CI 

lo . z /o 


XO ■ ouz 


1 nn 

X . UU 


23 


.22 


0 


ATOM 


1782 


N 


PHE A 254 


54 . 297 


1 / . oZx 


1 /r OT Q 
Xo . Z / 0 


1 nn 

X • UU 


23 


.14 


N 


ATOM 


1783 


CA 


PHE A 254 


53 . 683 


17 - 993 


IT vi n 
X I . 4Uo 


1 nn 

X . UU 


23 


.19 


p 


ATOM 


1784 


CB 


PHE A 254 


54,295 


17.481 


18 .702 


1 . 00 


22 


.64 


C 


ATOM 


1785 


CG 


APHE A 254 


53.868 


18.247 


19.912 


0.70 


21 


.91 


C 


ATOM 


1786 


CG 


BPHE A 254 


53. 915 


16.067 


18.997 


0.30 


22 


.47 


C 


ATOM 


1787 


CDIAPHE A 254 


52.711 


17.897 


20.596 


0.70 


20 


.57 


C 


ATOM 


1788 


CDIBPHE A 254 


54.877 


15.073 


19.055 


0.30 


21 


.39 


C 


ATOM 


1789 


CEIAPHE A 254 


52.308 


18.608 


21.724 


0.70 


18 


.82 


C 


ATOM 


1790 


CEIBPHE A 254 


54.517 


13.772 


19.304 


0.30 


20 


.97 


C 


ATOM 


1791 


CZ 


APHE A 254 


53.054 


19.677 


22.163 


0.70 


20 


.33 


C 
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ATOM 


1792 


CZ BPHE 


A 


254 


ATOM 


1793 


CE2APHE 


A 


254 


ATOM 


1794 


CE2BPHE 


A 


254 


ATOM 


1795 


CD2APHE 


A 


254 


ATOM 


1796 


CD2BPHE 


A 


254 


ATOM 


1797 


C 


PHE 


A 


254 


ATOM 


1798 


o 


PHE 


A 


254 


ATOM 


1799 


N 


PHE 


A 


255 


ATOM 


1800 


OA 


PHE 


A 


255 


ATOM 


1801 


CB 


PHE 


A 


255 


ATOM 


1802 


CG 


PHE 


A 


255 


ATOM 


1803 


GDI 


PHE 


A 


255 


ATOM 


1804 


CEl 


PHE 


A 


255 


ATOM 


1805 


CZ 


PHE 


A 


255 


ATOM 


1806 


CE2 


PHE 


A 


255 


ATOM 


1807 


CD2 


PHE 


A 


255 


ATOM 


1808 


C 


PHE 


A 


255 


ATOM 


1809 


o 


PHE 


A 


255 


ATOM 


1810 


N 


ARG 


A 


256 


ATOM 


1811 


CA 


ARG 


A 


256 


ATOM 


1812 


CB 


ARG 


A 


256 


ATOM 


1813 


CG 


ARG 


A 


256 


ATOM 


1814 


CD 


ARG 


A 


256 


ATOM 


1815 


ME 


ARG 


A 


256 


ATOM 


1816 


CZ 


ARG 


A 


256 


ATOM 


1817 


NHl 


ARG 


A 


256 


ATOM 


1818 


NH2 


ARG 


A 


256 


ATOM 


1819 


C 


ARG 


A 


256 


ATOM 


1820 


o 


ARG 


A 


256 


ATOM 


1821 


N 


GLN 


A 


257 


ATOM 


1822 


CA 


GLN 


A 


257 


ATOM 


1823 


CB 


GLN 


A 


257 


ATOM 


1824 


CG 


GLN 


A 


257 


ATOM 


1825 


CD 


GLN 


A 


257 


ATOM 


1826 


OEl 


GLN 


A 


257 


ATOM 


1827 


NE2 


GLN 


A 


257 


ATOM 


1828 


C 


GLN 


A 


257 


ATOM 


1829 


0 


GLN 


A 


257 


ATOM 


1830 


N 


ARG 


A 


258 


ATOM 


1831 


CA 


ARG 


A 


258 


ATOM 


1832 


CB 


ARG 


A 


258 


ATOM 


1833 


CG 


ARG 


A 


258 


ATOM 


1834 


CD 


ARG 


A 


258 


ATOM 


1835 


NE 


ARG 


A 


258 


ATOM 


1836 


CZ 


ARG 


A 


258 


ATOM 


1837 


NHl 


ARG 


A 


258 


ATOM 


1838 


NH2 


ARG 


A 


258 


ATOM 


1839 


C 


ARG 


A 


258 


ATOM 


1840 


O 


ARG 


A 


o c o 
250 


ATOM 


1841 


N 


VAL 


A 


259 


ATOM 


1842 


CA 


VAL 


A 


259 


ATOM 


1843 


CB 


VAL 


A 


259 


ATOM 


1844 


CGI 


VAL 


A 


259 


ATOM 


1845 


CG2 


VAL 


A 


259 


ATOM 


1846 


C 


VAL 


A 


259 


ATOM 


1847 


O 


VAL 


A 


259 


ATOM 


1848 


N 


SER 


A 


260 


ATOM 


1849 


CA 


SER 


A 


260 


ATOM 


1850 


CB 


SER 


A 


260 


ATOM 


1851 


OG 


SER 


A 


260 


ATOM 


1852 


C 


SER 


A 


260 


ATOM 


1853 


O 


SER 


A 


260 


ATOM 


1854 


N 


SER 


A 


261 


ATOM 


1855 


CA 


SER 


A 


261 


ATOM 


1856 


CB 


SER 


A 


261 


ATOM 


1857 


OG 


SER 


A 


261 


ATOM 


1858 


C 


SER 


A 


261 


ATOM 


1859 


O 


SER 


A 


261 


ATOM 


1860 


N 


GLU 


A 


262 
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53 . 


180 


13 . 


445 


19. 


476 


54 . 


214 


20 . 


045 


21 . 


486 


52 . 


207 


14 . 


421 


19 . 


400 


54 . 


615 


19 . 


333 


20 . 


369 


52. 


574 


15 . 


720 


19 . 


157 


53 . 


789 


19 . 


507 


17 . 


295 


54 . 


876 


20 . 


055 


17 . 


059 


52 . 


652 


20 . 


173 


17 . 


475 


52 . 


600 


21 . 


63 6 


17 . 


448 


51- 


367 


22. 


117 


16. 


705 


51- 


421 


21 . 


819 


15 - 


250 


51. 


972 


22 . 


743 


14 - 


368 


52. 


048 


22. 


466 


13 . 


018 


51. 


585 


21 . 


254 


12 - 


540 


51. 


047 


20 . 


307 


13 . 


426 


50 . 


974 


20. 


595 


14 . 


7 62 


52 . 


668 


22. 


239 


18 . 


830 


51. 


839 


21 . 


958 


19. 


691 


53. 


701 


23 . 


057 


19 . 


015 


54 . 


026 


23 . 


713 


20 . 


274 


55. 


556 


23 . 


791 


20 . 


412 


56. 


282 


24 . 


268 


1 9 . 


116 


57 . 


825 


24 . 


203 


19. 


. 1 64 


58. 


346 


23 . 


, 171 


18 . 


. 257 


59. 


,580 


22 , 


, 660 


18 . 


,298 


60. 


,466 


23 . 


, 080 


19 . 


, 20,4 


59. 


, 930 


21. 


,716 


17 . 


,431 


53, 


,444 


25. 


,122 


20 . 


.255 


53. 


,389 


25. 


. 807 


21 . 


, 279 


53. 


.023 


25. 


.546 


19 . 


, 068 


52. 


.369 


26. 


, 834 


18 , 


. 871 


53. 


. 126 


27 . 


, 643 


17 . 


. 803 


54 . 


, 514 


28. 


.095 


18 . 


. 215 


54 . 


. 493 


29 , 


.381 


19. 


. 019 


53, 


. 596 


30. 


. 222 


18 , 


. 846 


55, 


. 480 


29 , 


. 545 


19 , 


. 901 


50, 


. 931 


26, 


. 611 


18 , 


.399 


50, 


. 618 


25, 


. 574 


17 . 


. 821 


50, 


. 072 


27 


.593 


18 , 


. 633 


48 


.726 


27 


. 571 


18 


. 079 


47 


. 862 


28 


. 674 


18 


. 683 


46 


.355 


28 


, 438 


18, 


.509 


45.834 


28 


. 809 


17 


,134 


44 


.538 


28 


.195 


16, 


.847 


43 


.844 


28 


.395 


15 


.725 


44 


,316 


29 


.200 


14 


.757 


42 


.677 


27 


.789 


15 


.570 


48 


-811 


27 


.738 


16.564 


49 


.282 


28 


.759 


16 


.074 


48 


.367 


26 


.716 


15 


.843 


48 


.404 


26.682 


14 


.389 


49 


.533 


25 


.733 


13 


.879 


49 


.472 


25 


.544 


12 


,361 


50 


.929 


26 


.259 


14 


.310 


47 


.043 


26.171 


13 


. 920 


46 


.541 


25 


.190 


14 


.460 


46 


.451 


26 


.843 


12 


-930 


45 


.141 


26 


.451 


12 


.398 


44 


-687 


27 


.398 


11 


.273 


45 


.399 


27 


.137 


10 


-073 


45 


-145 


25 


.005 


11 


.916 


46 


.182 


24 


.484 


11 


.518 


43 


.974 


24 


.367 


11 


.957 


43 


.820 


22 


. 972 


11 


.559 


42 


.398 


22 


.499 


11 


.855 


42 


.169 


22 


.473 


13 


.254 


44 


.118 


22 


.748 


10 


.082 


44 


.630 


21 


.694 


9 


.701 


43 


.780 


23 


.729 


9 


,256 
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0.30 21.52 C 

0.70 20.58 C 

0.30 21.67 C 

0.70 21.56 C 

0.30 21.61 C 

1.00 23.65 C 

1.00 23.03 O 

1.00 24.25 N 

1.00 24.57 C 

1.00 24.07 C 

1.00 22.93 C 

1.00 20.88 C 

1.00 18.91 C 

1.00 20.61 C 

1.00 19.19 C 

1.00 19.54 C 

1.00 25.43 C 

1.00 25.30 O 

1.00 26.64 N 

1.00 27.50 C 

1,00 28.35 C 

1.00 31.19 C 

1.00 35.66 C 

1.00 38.13 N 

1.00 40.12 C 

1.00 40.47 N 

1.00 38.85 N 

1.00 27.08 C 

1.00 27.90 O 

1.00 26.07 N 

1.00 25.36 C 

1.00 25.81 C 

1.00 29.91 C 

1.00 35.63 C 

1 . 00 37. 93 O 

1.00 37.58 N 

1.00 23.15 C 

1.00 22.57 O 

1.00 21.04 N 

1.00 19.58 C 

1.00 19-69 C 

1-00 21.79 C 

1.00 24.81 C 

1-00 26.49 N 

1-00 27.04 C 

1.00 27.33 N 

1.00 25.64 N 

1-00 18,78 C 

1.00 18.29 O 

1.00 17.49 N 

1.00 17.02 C 

1.00 16.57 C 

1.00 15.19 C 

1.00 17.98 C 

1.00 16.99 C 

1.00 16.43 O 

1.00 17.02 N 

1,00 17.31 C 

1-00 17-29 C 

1.00 17.50 O 

1.00 17.62 C 

1.00 17.40 O 

1.00 17.98 N 

1.00 18.43 C 

1.00 18.46 C 

1.00 19.59 O 

1.00 18.64 C 

1.00 18.31 O 

1-00 19.27 N 
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ATOM 


1861 


CA 


GLU 


A 


262 


44 . 


102 


ATOM 


1862 


CB 


GLU 


A 


262 


43. 


461 


ATOM 


1863 


CG 


GLU 


A 


262 


42. 


033 


ATOM 


1864 


CD 


GLU 


A 


262 


41. 


304 


ATOM 


1865 


OEl 


GLU 


A 


262 


41. 


928 


ATOM 


1866 


OE2 


GLU 


A 


2 62 


40. 


101 


ATOM 


1867 


C 


GLU 


A 


262 


45. 


615 


ATOM 


1868 


O 


GLU 


A 


262 


46. 


131 


ATOM 


1869 


N 


CYS 


A 


263 


46. 


318 


ATOM 


1870 


CA 


CYS 


A 


263 


47, 


785 


ATOM 


1871 


CB 


CYS 


A 


263 


48. 


359 


ATOM 


1872 


SG 


CYS 


A 


263 


50. 


133 


ATOM 


1873 


C 


CYS 


A 


263 


48. 


385 


ATOM 


1874 


O 


CYS 


A 


263 


49. 


223 


ATOM 


1875 


N 


GLN 


A 


264 


47 . 


932 


ATOM 


1876 


CA 


GLN 


A 


264 


48. 


389 


ATOM 


1877 


CB 


GLN 


A 


264 


47, 


650 


ATOM 


■ 1878 


CG 


GLN 


A 


264 


48. 


085 


ATOM 


1879 


CD 


GLN 


A 


264 


47 . 


598 


ATOM 


1880 


OEl 


GLN 


A 


264 


47. 


359 


ATOM 


1881 


NE2 


GLN 


A 


264 


47 . 


464 


ATOM 


1882 


C 


GLN 


A 


264 


48. 


191 


ATOM 


1883 


O 


GLN 


A 


264 


49. 


,068 


ATOM 


1884 


N 


HIS 


A 


265 


47. 


,033 


ATOM 


1885 


CA 


HIS 


A 


265 


46. 


712 


ATOM 


1886 


CB 


HIS 


A 


265 


45. 


.269 


ATOM 


1887 


CG 


HIS 


A 


265 


44. 


.890 


ATOM 


1888 


NDl 


HIS 


A 


2 65 


45. 


.147 


ATOM 


1889 


CEl 


HIS 


A 


2 65 


44. 


,712 


ATOM 


1890 


NE2 


HIS 


A 


265 


44. 


.190 


ATOM 


1891 


CD2 


HIS 


A 


265 


44. 


.294 


ATOM 


1892 


C 


HIS 


A 


2 65 


47. 


.701 


ATOM 


1893 


O 


HIS 


A 


265 


48, 


.219 


ATOM 


1894 


N 


LEU 


A 


266 


47, 


.973 


ATOM 


1895 


CA 


LEU 


A 


266 


48, 


.891 


ATOM 


1896 


CB 


LEU 


A 


266 


48 


.925 


ATOM 


1897 


CG 


LEU 


A 


266 


49 


.889 


ATOM 


1898 


GDI 


LEU 


A 


266 


49 


.700 


ATOM 


1899 


CD2 


LEU 


A 


266 


49 


.731 


ATOM 


1900 


C 


LEU 


A 


266 


50.282 


ATOM 


1901 


O 


LEU 


A 


266 


50 


.906 


ATOM 


1902 


N 


ILE 


A 


267 


50 


.741 


ATOM 


1903 


CA 


ILE 


A 


267 


52 


.072 


ATOM 


1904 


CB 


ILE 


A 


267 


52 


.425 


ATOM 


1905 


CGI 


ILE 


A 


2 67 


52 


.702 


ATOM 


1906 


CDl 


ILE 


A 


267 


52 


. 656 


ATOM 


1907 


CG2 


ILE 


A 


267 


53 


. 626 


ATOM 


1908 


C 


ILE 


A 


267 


52 


.173 


ATOM. 


1909 


o 


ILE 


A 


267 


53 


.119 


ATOM 


1910 


N 


ARG 


A 


268 


51 


.178 


ATOM 


1911 


CA 


ARG 


A 


268 


51 


.165 


ATOM 


1912 


CB 


ARG 


A 


268 


49 


.990 


ATOM 


1913 


QG 


ARG 


A 


268 


50 


.240 


ATOM 


1914 


CD 


ARG 


A 


268 


49 


.234 


ATOM 


1915 


NE 


ARG 


A 


268 


48 


.912 


ATOM 


1916 


CZ 


ARG 


A 


268 


49 


.629 


ATOM 


1917 


NHl 


ARG 


A 


268 


50 


.663 


ATOM 


1918 


NH2 


ARG 


A 


268 


49 


.331 


ATOM 


1919 


C 


ARG 


A 


268 


51 


.104 


ATOM 


1920 


o 


ARG 


A 


268 


51 


. 676 


ATOM 


1921 


N 


TRP 


A 


269 


50 


.397 


ATOM 


1922 


CA 


TRP 


A 


269 


50 


.336 


ATOM 


1923 


CB 


TRP 


A 


269 


49 


.340 


ATOM 


1924 


CG 


TRP 


A 


269 


48 


,810 


ATOM 


1925 


CDl 


TRP 


A 


269 


49 


.030 


ATOM 


1926 


NEl 


TRP 


A 


269 


48 


.387 


ATOM 


1927 


CE2 


TRP 


A 


269 


47 


.716 


ATOM 


1928 


CD2 


TRP 


A 


269 


47 


.957 


ATOM 


1929 


CE3 


TRP 


A 


269 


47 


.377 
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23.659 7.829 1.00 20.49 C 

24-807 7.058 1.00 21.45 C 

24.525 6.627 1.00 27.18 C 

25.782 6.184 1.00 35.25 C 

26.645 5.498 1.00 38.82 O 

25,915 6.522 1.00 39.29 O 

23.663 7.614 1.00 19.31 C 
22.851 6.853 1.00 18.98 O 

24.561 8.297 1.00 18.97 N 
24.596 8.196 1.00 18.66 C 
25,805 8.937 1.00 18.56 C 
26.031 8.731 1.00 18.69 S 
23.275 8.703 1.00 18.45 C 

22.664 8.024 1.00 18.06 O 
22.827 9.873 1.00 18.01 N 
21.553 10.434 1.00 18.32 C 
21.210 11.748 1.00 17.97 C 
22.034 12.955 1.00 18.25 C 
21.447 14.282 1.00 20.77 C 
20.240 14.382 1.00 19.45 O 
22.299 15.304 1.00 19.10 N 
20.419 9.424 1.00 18.18 C 
19.581 9.252 1.00 18.03 O 
20.405 8.768 1.00 18.36 N 
19.366 7.805 1.00 19.15 C 
19.505 7.310 1.00 19.80 C 
18.474 6.295 1.00 23.71 C 
18.627 4.948 1.00 27.52 N 

17.562 4.294 1.00 28.98 C 
16.720 5.170 1.00 29.95 N 
17,264 6.430 1.00 27.50 C 
19..383 6.624 1.00 18.47 C 
18.340 6.236 1.00 17.33 O 
20.577 6.089 1.00 17.86 N 
20.717 4.968 1.00 17.70 C 
22.167 4.440 1.00 17.79 C 

22.490 3.277 1.00 16.98 C 
21.533 2.080 1.00 16.25 C 
23.941 2.832 1.00 16.33 C 
20.225 5.372 i . 00 17.61 C 
19.443 4.642 1.00 17.30 O 
20.639 6.555 1.00 17.19 N 
20.263 7.023 1-00 16.80 C 
20.934 8.385 1.00 16.67 C 
22.433 8.196 1.00 15.36 C 
23.273 9.494 1.00 14.12 C 
20.245 9.024 1.00 15.66 C 
18.753 7.137 1.00 17.49 C 
18.156 6.618 1.00 16.97 O 
18.140 7.783 1.00 17.52 N 
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2300 


o 


HOH 


w 


29 


70. 


325 


37.710 


6. 


677 


1. 


00 


31. 


97 


o 


ATOM 


2301 


o 


HOH 


w 


30 


71. 


184 


18.590 


9. 


525 


1. 


00 


35. 


40 


0 


ATOM 


2302 


o 


HOH 


w 


31 


53. 


890 


12.402 


-3. 


734 


1. 


00 


32. 


44 


o 


ATOM 


2303 


o 


HOH 


w 


32 


52. 


246 


19.524 


-9. 


419 


1. 


00 


35. 


84 


0 


ATOM 


2304 


0 


HOH 


w 


33 


40. 


639 


26.398 


16. 


837 


1. 


00 


35. 


65 


o 


ATOM 


2305 


o 


HOH 


w 


34 


60. 


620 


13.344 


-8. 


811 


1. 


00 


44. 


92 


0 


ATOM 


2306 


o 


HOH 


w 


35 


75. 


110 


44 . 117 


2. 


424 


1. 


00 


40. 


04 


o 


ATOM 


2307 


o 


HOH 


w 


36 


74 . 


471 


37.990 


7 . 


461 


1 . 


00 


40. 


83 


0 


ATOM 


2308 


o 


HOH 


w 


37 


59. 


228 


35.799 


-5 . 


068 


1. 


00 


33. 


92 


0 


ATOM 


2309 


o 


HOH 


w 


38 


57 . 


123 


17.309 


16. 


065 


1 . 


00 


40. 


82 


o 


ATOM 


2310 


o 


HOH 


w 


39 


73. 


994 


32.523 


-2 . 


675 


1 . 


00 


33. 


30 


o 


ATOM 


2311 


o 


HOH 


w 


40 


69. 


993 


44 . 693 


3 . 


957 


1, 


00 


50. 


42 


o 


ATOM 


2312 


o 


HOH 


w 


41 


65. 


864 


18.120 


0. 


377 


1 . 


00 


37 . 


77 


0* 


ATOM 


2313 


o 


HOH 


w 


42 


48. 


834 


35.741 


2. 


440 


1 . 


00 


31. 


77 


o 


ATOM 


2314 


o 


HOH 


w 


43 


52. 


185 


7- 956 


4 . 


576 


1 . 


00 


38. 


71 


o 


ATOM 


2315 


o 


HOH 


w 


44 


64. 


7 65 


11.133 


-15. 


777 


1 . 


00 


33. 


65 


0 


ATOM 


2316 


o 


HOH 


w 


45 


48. 


197 


17.129 


-9- 


023 


1 . 


00 


35 . 


25 


o 


ATOM 


2317 


o 


HOH 


w 


46 


71. 


559 


9.704 


-7 . 


782 


1 . 


00 


42 . 


63 


0 


ATOM 


2318 


o 


HOH 


w 


47 


72 . 


838 


31-092 


-4 . 


833 


1 . 


00 


32 . 


07 


o 


ATOM 


2319 


o 


HOH 


w 


48 


54 . 


340 


33 .741 


-9. 


311 


1 . 


00 


38 . 


01 


0 


ATOM 


2320 


o 


HOH 


w 


49 


54 . 


223 


11. 905 


9. 


111 


1 . 


00 


33 . 


22 


0 


ATOM 


2321 


o 


HOH 


w 


50 


52 . 


887 


36. 641 


-1- 


776 


1 . 


00 


38 . 


05 


0 


ATOM 


2322 


o 


HOH 


w 


51 


58 . 


033 


32 . 102 


18 . 


276 


1 . 


00 


41 . 


20 


0 


ATOM 


2323 


o 


HOH 


w 


52 


58 . 


7 64 


11 .092 


17 . 


987 


1 . 


00 


35 . 


48 


o 


ATOM 


2324 


o 


HOH 


w 


53 


56. 


210 


29.247 


-10 . 


737 


1 . 


00 


40 . 


52 


o 


ATOM 


2325 


o 


HOH 


w 


54 


75 . 


583 


29 . 566 


5 . 


684 


1 . 


00 


42 . 


12 


0 


ATOM 


2326 


0 


HOH 


w 


55 


82 . 


299 


27 .704 


4 . 


152 


1 . 


00 


43 . 


41 


o 


ATOM 


2327 


o 


HOH 


w 


56 


61 . 


670 


6 . 087 


15 . 


210 


1 . 


00 


42 . 


4 6 


0 


ATOM 


2328 


o 


HOH 


w 


57 


41 . 


909 


26 . 005 


9 . 


492 


1 , 


00 


38 . 


52 


0 


ATOM 


2329 


o 


HOH 


w 


58 


72. 


941 


15. 417 


4 . 


788 


1 . 


00 


53 . 


05 


0 


ATOM 


2330 


o 


HOH 


w 


59 


56. 


478 


27 .573 


-12. 


787 


1 . 


00 


37 . 


09 


0 


ATOM 


2331 


o 


HOH 


w 


60 


83. 


158 


40.743 


6, 


932 


1 . 


00 


41 . 


95 


0 


ATOM 


2332 


o 


HOH 


w 


61 


44. 


574 


20.141 


-2. 


416 


1. 


00 


38. 


16 


0 


ATOM 


2333 


o 


HOH 


w 


62 


51. 


818 


13 . 854 


13 . 


409 


1 . 


00 


36- 


87 


0 


ATOM 


2334 


o 


HOH 


w 


63 


56, 


901 


22.491 


-11. 


879 


1. 


00 


46, 


71 


0 


ATOM 


2335 


o 


HOH 


w 


64 


46. 


066 


31.890 


-3 . 


335 


1 . 


00 


46. 


54 


o 


ATOM 


2336 


o 


HOH 


w 


65 


46. 


390 


17 . 471 


11 . 


120 


1 . 


00 


41. 


50 


0 


ATOM 


2337 


o 


HOH 


w 


66 


73. 


021 


18.047 


7 . 


67 9 


1. 


00 


45, 


56 


0 


ATOM 


2338 


o 


HOH 


w 


67 


56.272 


6-117 


-3. 


207 


1. 


00 


47 . 


16 


o 


ATOM 


2339 


o 


HOH 


w 


68 


78. 


807 


46.222 


C. 


194 


1. 


00 


43. 


86 


0 


ATOM 


2340 


o 


HOH 


w 


69 


70. 


343 


14.512 


-11. 


989 


1. 


00 


41. 


06 


o 


ATOM 


2341 


o 


HOH 


w 


70 


43. 


908 


38. 440 


0. 


67 0 


1. 


00 


53. 


02 


o 


ATOM 


2342 


o 


HOH 


w 


71 


40. 


352 


28.430 


2. 


051 


1. 


00 


45. 


97 


o 


ATOM 


2343 


o 


HOH 


w 


72 


44. 


496 


19.095 


11. 


235 


1. 


00 


54. 


47 


o 
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ATOM 


2344 


O 


HOH 


W 


73 


47. 


165 


ATOM 


2345 


O 


HOH 


W 


74 


56. 


445 


ATOM 


2346 


O 


HOH 




73. 


363 


ATOM 


2347 


O 


HOH 




67. 


665 


ATOM 


2348 


O 


HOH 


W 


77 


77. 


512 


ATOM 


2349 


O 


HOH 


W 


78 


64. 


562 


ATOM 


2350 


0 


HOH 


W 


79 


72. 


601 


ATOM 


2351 


O 


HOH 


W 


80 


64. 


569 


ATOM 


2352 


O 


HOH 


W 


81 


58. 


851 


ATOM 


2353 


O 


HOH 


W 


82 


66. 


378 


ATOM 


2354 


O 


HOH 


w 


83 


52. 


161 


ATOM 


2355 


O 


HOH 


w 


84 


84. 


302 


ATOM 


2356 


O 


HOH 


w 


85 


49. 


501 


ATOM 


2357 


0 


HOH 


w 


86 


63. 


118 


ATOM 


2358 


O 


HOH 


w 


87 


75. 


334 


ATOM 


2359 


o 


HOH 


w 


88 


51. 


946 


ATOM 


2360 


o 


HOH 


w 


89 


46. 


051 


ATOM 


2361 


o 


HOH 


w 


90 


60. 


662 


ATOM 


2362 


o 


HOH 


w 


91 


78. 


926 


ATOM 


2363 


o 


HOH 


w 


92 


83. 


687 


ATOM 


2364 


o 


HOH 


w 


93 


65. 


774 


ATOM 


2365 


0 


HOH 


w 


94 


48. 


,890 


ATOM 


2366 


o 


HOH 


w 


95 


71. 


.057 


ATOM 


2367 


o 


HOH 


w 


96 


73. 


.156 


ATOM 


2368 


o 


HOH 


w 


97 


56. 


.031 


ATOM 


2369 


o 


HOH 


w 


98 


90. 


.130 


ATOM 


2370 


o 


HOH 


w 


99 


64. 


.199 


ATOM 


2371 


o 


HOH 


w 


100 


52. 


.185 


ATOM 


2372 


o 


HOH 


w 


101 


78. 


.245 


ATOM 


2373 


o 


HOH 


w 


102 


70, 


.395 


ATOM 


2374 


o 


HOH 


w 


103 


76 


.497 


ATOM 


2375 


o 


HOH 


w 


104 


53, 


.869 


ATOM 


2376 


o 


HOH 


w 


105 


52, 


.957 


ATOM 


2377 


o 


HOH 


w 


106 


81 


.062 


ATOM 


2378 


o 


HOH 


w 


107 


8f 


.023 


ATOM 


2379 


o 


HOH 


w 


108 


55 


.351 


ATOM 


2380 


o 


HOH 


w 


109 


72 


.893 


ATOM 


2381 


0 


HOH 


w 


110 


64 


.150 


ATOM 


2382 


o 


HOH 


w 


111 


70 


.497 


ATOM 


2383 


o 


HOH 


w 


112 


47 


.743 


ATOM 


2384 


o 


HOH 


w 


113 


67 


.986 


ATOM 


2385 


o 


HOH 


w 


114 


65 


.956 


ATOM 


2386 


o 


HOH 


w 


115 


71 


.948 


ATOM 


2387 


0 


HOH 


w 


116 


73 


.384 


ATOM 


2388 


o 


HOH 


w 


117 


69 


.213 


ATOM 


2389 


o 


HOH 


w 


118 


92 


.376 


ATOM 


2390 


o 


HOH 


w 


119 


71 


.748 


ATOM 


2391 


0 


HOH 


w 


120 


72 


.751 


ATOM 


2392 


o 


HOH 


w 


121 


44 


.373 


ATOM 


2393 


o 


HOH 


w 


122 


72 


.331 


ATOM 


2394 


o 


HOH 


w 


123 


85.766 


ATOM 


2395 


o 


HOH 


w 


124 


82 


.375 


ATOM 


2396 


o 


HOH 


w 


125 


69 


.185 


ATOM 


2397 


o 


HOH 


w 


126 


72 


.843 


ATOM 


2398 


o 


HOH 


w 


127 


58 


.459 


ATOM 


2399 


o 


HOH 


w 


128 


64 


.272 


ATOM 


2400 


o 


HOH 


w 


129 


59 


.782 



15.788 6.720 1.00 41.01 O 

43.287 --5. 157 1.00 44.15 O 

25.539 -17.736 1-00 50.03 O 

14.838 -15.990 1.00 47,37 O 

31.796 8.477 1.00 43.71 O 

45.570 -0.458 1.00 42.82 O 

12.906 5.087 1.00 41.65 O 

29.017 13.834 1.00 46.57 O 

5.715 -3.038 1.00 36.10 O 

16.370 -1.785 1.00 36.18 O 

13.315 8.870 1.00 38.72 O 

27.201 -0.028 1.00 44.09 O 

13.263 -4.243 1.00 40.50 O 

41.154 -5.640 1.00 44.61 O 

10.847 -0.667 1.00 41.03 O 

9.440 7.089 1.00 44.13 O 

15.476 9.731 1.00 44.74 O 

7.651 -3.345 1.00 33.21 O 

37.602 8.589 1.00 45.81 O 

38.788 8.645 1.00 42-46 O 

37.305 -9.200 1.00 42.61 O 

32.798 13.190 1.00 44.81 O 

6.982 7.124 1.00 46.01 O 

42.259 2.367 1.00 41.64 O 

35.393 16.920 1.00 47.09 O 

23.863 -3.527 1.00 56.29 O 

16.375 1.499 1.00 32.31 O 

30.882 13.804 1.00 45,03 O 

25.957 -3.060 1.00 37.82 O 

32.498 -12.472 1.00 40.91 O 

26.635 -1.230 1,00 45.90 O 

39.933 10.992 1.00 48.40 O 

42.953 -5.317 1.00 43.64 O 

46.768 -1.405 1.00 51.11 O 

38.607 -2.261 1.00 44.62 O 

15.949 -6.982 1,00 56.30 O 

16.276 -11.510 1.00 40.24 O 

29.039 11-361 1.00 44.87 O 

11.613 14.584 1.00 42.70 O 

30.590 14.200 1.00 41.84 O 

19.239 2.487 1.00 45.28 O 

9.523 -15.205 1.00 44.06 O 

39.028 3.510 1.00 48.92 O 

36.583 9.395 1.00 49.46 O 

13.943 11.885 1.00 43.79 O 

29.991 -13.421 1.00 50.66 O 

33.616 8.364 1.00 44.60 O 

30.625 9.138 1.00 54.54 O 

14.896 1.763 1.00 54.90 O 

37.663 5,252 1.00 54.91 O 

37.929 7.966 1.00 45.42 O 

46.624 -12.952 1.00 49.98 O 

5.514 -10.117 1.00 57.15 O 

16.943 -2.415 1.00 48,35 O 

18.549.-11.193 1.00 68.47 O 

33.293 -12.839 1.00 48.86 O 

37,121 -16.253 1.00 59.29 O 
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TABLES 



DISEASES ASSOCIATED WITH GENETIC DEFECTS IN KINASES 



Serine/Threonine Protein Kinases 

CYGLIN-DEPENDENT KINASE 7; CDK7 
DYSTROPHIA MYOTONICA; DM 
MINIBRAIN (DROSOPHILA) HOMOLOG; MNBH 
RAC SERINE/THREONINE PROTEIN KINASE 
SERINE-THREONINE PROTEIN KINASE N; PKN 
SERINE/THREONINE PROTEIN KINASE 2; STK2 

PROTEIN KINASE, DNA-ACTIVATED, CATALYTIC SUBUNIT; PRKDC 
ZIPPER PROTEIN KINASE; ZPK 
PROTEIN-TYROSINE KINASE STY 

BRUTON AGAMMAGLOBULINEMIA TYROSINE KINASE; BTK 
MKN28 KINASE 

PROTEIN KINASE, X-LINKED; PRKK 
ELK-RELATED TYROSINE KENASE; ERK 

RIBOSOMAL PROTEIN S6 KINASE, 90 KD, POLYPEPTIDE 3; RPS6KA3 
GLYCOGEN STORAGE DISEASE VIH 
DEATH-ASSOCIATED PROTEIN KINASE 1 ; DAPKl 
PCTATRE PROTEEM KINASE 1; PCTKl 

PROTEIN KINASE, INTERFERON-INDUCIBLE DOUBLE-STRANDED RNA; PBKR 
ACTIVIN A RECEPTOR, TYPE H-LIKE KINASE 1; ACVRLKl 
PROTEIN KINASE, cAMP-DEPENDENT, CATALYTIC, ALPHA; PRKACA 
PROTEIN KINASE, Y-LINKED; PRK Y 

GPROTEIN-COUPLED RECEPTOR KIN ASE 2 (DROSOPHILA)-LIKE; GPRK2L 

PROTEIN KINASE C, THETA FORM; PRKCQ 

LIM DOMAIN KINASE 1; LEVQCl 

PHOSPHOGLYCERATE KINASE I; PGKl 

LIM DOMAIN KINASE 2; LIMK2 

C-JUN KINASE 

ACTIVIN A RECEPTOR, TYPE H-LIKE KINASE 2; ACVRLK2 

JANUS KINASE 1; JAKl 

ELKL MOTIF KINASE; EMKl 

MALE GERM CELL-ASSOCIATED KINASE; MAK 

CASEIN KINASE 2, ALPHA-PRIME SUBUNIT; CSNK2A2 

CASEIN KINASE 2, BETA POLYPEPTIDE; CSNK2B 

CASEIN KINASE 2, ALPHA 1 POLYPEPTIDE; CSNK2A1 

LEUKEMIA, CHRONIC MYELOID; CML 

RET PROTO-ONCOGENE; RET 

HEMATOPOIETIC PROGENITOR KINASE I 

CONSERVED HELIX-LOOP-HELIX UBIQUITOUS KINASE; CHUK 
CASEIN KINASE 1, DELTA; CSNKID 
CASEIN KINASE 1, EPSILON; CSNKIE 
DEATH-ASSOCIATED PROTEIN; DAP 

V-AKT MURINE THYMOMA VIRAL ONCOGENE HOMOLOG 1; AKTl 
TUMOR PROTEIN p53; TP53 

PROTEIN PHOSPHATASE 1, REGULATORY (E^IBITOR) SUBUNIT 2; PPP1R2 

SEVERE COMBINED IMMUNODEFICIENCY DISEASE-1; SCIDl 

HYPERLIPOPROTEINEML\, TYPE I 

COLLAGEN, TYPE I, ALPHA-1 CHAIN; COLl Al 

UBIQUITIN-BINDING PROTEIN P62 

TAY-SACHS DISEASE; TSD 

CYSTIC FIBROSIS; CF 

pl60-ROCK 
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BCL2-ASSOCIATED ATHANOGENE 1 ; BAG 1 

HYPOXANTHINE GUA3>nNEPHOSPHORIBOSYLTRANSFERASE 1; HPRTl 
ONCOGENE PIM-1; PIMl; PIM 

CARTILAGE-DERIVED MORPHOGENETIC PROTEIN 1 

GALACTOSEMIA 

GRANULOMATOUS DISEASE, CHRONIC, AUTOSOMAL CYTOCHROME-b-POSITIVE FORM I 
TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE II; TGFBR2 
RHODOPSIN; RHO 

MOTHERS AGAINST DECAPENTAPLEGIC (DROSOPHILA) HOMOLOG 2; MADH2 
TRIPLE FUNCTIONAL DOMAIN; TRIO 

TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE I; TGFBRl 

GENERAL TRANSCRIPTION FACTOR IIH, POLYPEPTIDE 1 ; GTF2H1 

ANTI-MULLERIAN HORMONE TYPE H RECEPTOR; AMHR2 

V-RAF MURINE SARCOMA VIRAL ONCOGENE HOMOLOG Bl ; BRAF 

BONE MORPHOGENETIC RECEPTOR TYPE H; BMPR2 

GLYCOGEN STORAGE DISEASE V 

COMPLEX IV, CYTOCHROME c OXIDASE SUBUNIT HI 

V-RAF MURINE SARCOMA 361 1 VIRAL ONCOGENE HOMOLOG 1 ; ARAFl 

GAUCHER DISEASE, TYPE I; GD I 

V-RAF MURINE SARCOMA 361 1 VIRAL ONCOGENE HOMOLOG 2; ARAF2 

CYCLIN A; CCNA 

ANEMIA, HYPOCHROMIC; ANHl 

GRANULOMATOUS DISEASE, CHRONIC; CGD 

ACTIVIN A RECEPTOR, TYPE I; ACVRl 



Serine Protein Kinases 

CYCLIN-DEPENDENT KINASE 7; CDK7 
DYSTROPHIA MYOTONICA; DM 
MINIBRAIN (DROSOPHILA) HOMOLOG; MNBH 
RAC SERINE/THREONINE PROTEIN KINASE 
SERINE-THREONINE PROTEIN KINASE N; PKN 
PROTEIN KINASE, SERINE/ARGININE-SPECIFIC 
PROTEIN SERINE KINASE HI; PSKHl 
SERINE/THREONINE PROTEIN KINASE 2; STK2 

PROTEIN KINASE, DNA-ACTIVATED, CATALYTIC SUBUNIT; PRKDC 
ZIPPER PROTEIN KINASE; ZPK 
PROTEIN-TYROSINE KINASE STY 
SYK-RELATED TYROSINE KINASE; SRK 

BRUTON AGAMMAGLOBULINEMIA TYROSINE KINASE; BTK 
MKN28 KINASE 

NON-METASTATIC CELLS 1, PROTEIN EXPRESSED IN; NMEl 

ELK-RELATED TYROSINE KINASE; ERK 

PROTEIN KINASE, X-LINKED; PRKX 

FIBROBLAST GROWTH FACTOR RECEPTOR-2; FGFR2 

RDBOSOMAL PROTEIN S6 KINASE, 90 KD, POLYPEPTIDE 3; RPS6KA3 

PCTAIRE PROTEIN KINASE 1; PCTKl 

DEATH-ASSOCIATED PROTEIN KINASE 1; DAPKl 

ACTIVIN A RECEPTOR, TYPE H-LIKE KINASE 1 ; ACVRLKl 

GLYCOGEN STORAGE DISEASE VUI 

PROTEIN KINASE, INTERFERON-INDUCIBLE DOUBLE-STRANDED RNA; PRKR 
PROTEIN KINASE, Y-LINKED; PRKY 

G PROTEIN-COUPLED RECEPTOR KINASE 2 (DROSOPHILA)-LIKE; GPRK2L 
LIM DOMAIN KINASE 1; LIMKl 

PROTEIN KINASE, cAMP-DEPENDENT, CATALYTIC, ALPHA; PRKACA 
PROTEIN KINASE C, THETA FORM; PRKCQ 
C-JUN KINASE 

PHOSPHOGLYCERATE KINASE 1 ; PGKl 
LIM DOMAEM KINASE 2; LIMK2 
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ACTIVIN A RECEPTOR, TYPE II-LIKE KINASE 2; ACVRLK2 
JANUS KINASE 1; JAKl 
INSULIN RECEPTOR; INSR 

CASEIN KINASE 2, ALPHA 1 POLYPEPTIDE; CSNK2A1 

CASEIN KINASE 2, BETA POLYPEPTIDE; CSNK2B 

CASEIN KINASE 2, ALPHA-PRIME SUBUNIT; CSNK2A2 

ATAXIA-TELANGIECTASIA; AT 

MALE GERM CELL-ASSOCIATED KINASE; MAK 

ELKL MOTIF KINASE; EMKl 

LEUKEMIA, CHRONIC MYELOID; CML 

HEMATOPOIETIC PROGENITOR KINASE 1 

RET PROTO-ONCOGENE; RET 

CASEIN KINASE 1, DELTA; CSNKID 

CASEEM KINASE 1, EPSILON; CSNKIE 

CONSERVED HELIX-LOOP-HELIX UBIQUITOUS KINASE; CHUK 
HYPERCHOLESTEROLEMIA, FAMILIAL; FHC 
TUMOR PROTEIN p53; TP53 
TAY-SACHS DISEASE; TSD 
DEATH-ASSOCIATED PROTEIN; DAP 

V-AKT MURINE THYMOMA VIRAL ONCOGENE HOMOLOG 1; AKTl 
COLLAGEN, TYPE I, ALPHA-1 CHAIN; COLIA! 
SEVERE COMBINED IMMUNODEFICIENCY DISEASE-1 ; SCIDl 
ADENOMATOUS POLYPOSIS OF THE COLON; APC 

PROTEIN PHOSPHATASE 1, REGULATORY (INHIBITOR) SUBUNIT 2; PPP1R2 
UBIQUITIN-BINDING PROTEIN P62 

MUSCULAR DYSTROPHY, TARDIVE, DREIFUSS-EMERY TYPE, WITH CONTRACTURES 
CAVEOLIN, CAVEOLAE PROTEIN, 22~ICD; CAV 
FIBROBLAST GROWTH FACTOR RECEPTOR-3; FGFR3 
CYSTIC FIBROSIS; CF 

HYPOXANTHINE GUANINE PHOSPHORIBOSYLTRANSFERASE 1; HPRTl 
GROWTH FACTOR RECEPTOR-BOUND PROTEIN-10; GRBIO 
HYPERLIPOPROTEINEML^, TYPE I 
CREB-BINDING PROTEIN; CREBBP 
THYROID AUTOANTIGEN 70 KD; G22P1 
ONCOGENE PIM-1 ; PIMl; PIM 

SIGNAL TRANSDUCER AND ACTIVATOR OF TRANSCRIPTION 3; STAT3 

CYCLIC-AMP-RESPONSE-ELEMENT-BINDING PROTEIN- 1; CREBl 

BCL2-ASSOCIATED ATHANOGENE 1; BAGl 

P32 SPLICING FACTOR SF2 ASSOCIATED PROTEIN 

CARTILAGE-DERIVED MORPHOGENETIC PROTEIN 1 

pl60-ROCK 

GROWTH FACTOR RECEPTOR-BOUND PROTEE^-14; GRB14 
THROMBASTHENIA OF GLANZMANN AND NAEGELI 
INSULIN RECEPTOR SUBSTRATE 1 ; IRSl 

GRANULOMATOUS DISEASE, CHRONIC, AUTOSOMAL CYTOCHROME-b-POSITIVE FORM I 
GUANYLATE CYCLASE 2D, MEMBRANE; GUC2D 
GALACTOSEMIA 
RHODOPSIN; RHO 

TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE II; TGFBR2 

ICHTHYOSIS, X-LINKED 

TRIPLE FUNCTIONAL DOMAIN; TRIO 

MOTHERS AGAINST DECAPENTAPLEGIC (DROSOPHILA) HOMOLOG 2; MADH2 
MCF.2 CELL LINE DERIVED TRANSFORMING SEQUENCE; MCF2 
ESTROGEN RECEPTOR; ESR 

V-RAF MURINE SARCOMA 361 1 VIRAL ONCOGENE HOMOLOG 1 ; ARAFl 
V-RAF MURINE SARCOMA 361 1 VIRAL ONCOGENE HOMOLOG 2; ARAF2 
BONE MORPHOGENETIC RECEPTOR TYPE II; BMPR2 
GAUCHER DISEASE, TYPE I; GD I 

TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE I; TGFBRl 
GENERAL TRANSCRIPTION FACTOR IIH, POLYPEPTIDE 1 ; GTF2H1 
CHYMOTRYPSIN-LIKE. PROTEASE; CTRL 
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GLYCOGEN STORAGE DISEASE V 
CYCLINA; CCNA 

LETHAL GIANT LARVAE (DROSOPfflLA) HOMOLOG 1; LLGLl 
GRANULOMATOUS DISEASE, CHRONIC; CGD 
CASEIN, BETA; CSN2 

DENTIN MATRIX ACIDIC PHOSPHOPROTEIN 1; DMPl 
TREACHER COLLINS-FRANCESCHETTI SYNDROME 1; TCOFl 
COMPLEX IV, CYTOCHROME c OXIDASE SUBUNIT HI 
DIABETES MELLITUS, AUTOSOMAL DOMINANT 
V-RAF MURINE SARCOMA VIRAL ONCOGENE HOMOLOG Bl ; BRAF 
ANEMIA, HYPOCHROMIC; ANHl 

ANTI-MULLERL\N HORMONE TYPE II RECEPTOR; AMHR2 
PLASMINOGEN ACTIVATOR INHIBITOR, TYPE 2; PAI2 
ACTIVIN A RECEPTOR, TYPE I; ACVRl 



Threonine Protein Kinases 

CYCLIN-DEPENDENT KINASE 7; CDK7 
DYSTROPIHA MYOTONICA; DM 
MINIBRAIN (DROSOPHILA) HOMOLOG; MNBH 
RAC SERINE/THREONINE PROTEIN KINASE 
SERINE-THREONINE PROTEIN KINASE N; PKN 
SERINE/THREONINE PROTEIN KINASE 2; STK2 

PROTEIN KINASE, DNA-ACTIVATED, CATALYTIC SUBUNIT; PRKDC 
ZIPPER PROTEIN KINASE; ZPK 
PROTEIN-TYROSINE KINASE STY 

PROTEIN KINASE, MITOGEN-ACTIVATED, KINASE 1; PRKMKl 
BRUTON AGAMMAGLOBULINEMIA TYROSINE KINASE; BTK 
MKN28 KINASE 

PROTEIN KINASE, X-LINKED; PRKX 

ELK-RELATED TYROSINE KINASE; ERK 

GLYCOGEN STORAGE DISEASE VIII 

PYRUVATE KINASE DEFICIENCY OF ERYTHROCYTE 

RIBOSOMAL PROTEIN S6 KINASE, 90 KD, POLYPEPTIDE 3; RPS6KA3 

DEATH-ASSOCIATED PROTEIN KINASE 1; DAPKl 

PCTAIRE PROTEE^J KINASE 1 ; PCTKl 

PROTEIN KINASE, INTERFERON-INDUCIBLE DOUBLE-STRANDED RNA; PRKR 
ACTIVIN A RECEPTOR, TYPE II-LIKE KINASE 1; ACVRLKl 
PROTEIN KINASE, Y-LINKED; PRKY 

G PROTEIN-COUPLED RECEPTOR KINASE 2 (DROSOPlllLA)-LIKE; GPRK2L 
PROTEIN KINASE, cAMP-DEPENDENT, CATALYTIC, ALPHA; PRKACA 
PROTEIN KINASE C, THETA FORM; PRKCQ 

MUSCULAR DYSTROPHY, PSEUDOHYPERTROPHIC PROGRESSIVE, DUCHENNE AND BECKER 
TYPES; DMD 

LIM DOMAIN KINASE 1; LIMKl 
PHOSPHOGLYCERATE KINASE 1; PGKl 
LIM DOMAIN KINASE 2; LIMK2 
C-JUN KINASE 

ACTIVIN A RECEPTOR, TYPE II-LDCE KINASE 2; ACVRLK2 
JANUS KINASE 1 ; JAKl 

CASEIN KINASE 2, ALPHA 1 POLYPEPTIDE; CSNK2A1 
CASEIN KINASE 2, BETA POLYPEPTIDE; CSNK2B 
MALE GERM CELL-ASSOCIATED KINASE; MAK 
MEVALONICACIDURIA 
ELKL MOTIF KINASE; EMKl 

CASEIN KINASE 2, ALPHA-PREME SUBUNIT; CSNK2A2 

MAP KINASE KINASE 6 

CASEIN KINASE 1, EPSILON; CSNKIE 

CASEIN KINASE L DELTA: CSNKID 
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LEUKEMIA, CHRONIC MYELOID; CML 

CONSERVED HELIX-LOOP-HELIX UBIQUITOUS KINASE; CHUK 
HEMATOPOIETIC PROGENITOR KINASE 1 
RET PROTO-ONCOGENE; RET 

PROTEIN PHOSPHATASE 1, REGULATORY (INHIBITOR) SUBUNIT 2; PPP1R2 
DUAL SPECIFICITY PHOSPHATASE 1; DUSPl 

V-AKT MURINE THYMOMA VIRAL ONCOGENE HOMOLOG 1 ; AKTl 
DEATH-ASSOCIATED PROTEIN; DAP 
TUMOR PROTEIN p53; TP53 
GLYCOGEN STORAGE DISEASE II 

SEVERE COMBINED IMMUNODEFICIENCY DISEASE-1 ; SCIDl 
UBIQUITIN-BINDING PROTEIN P62 
HYPERLIPOPROTEINEMIA, TYPE I 
BCL2-ASSOCIATED ATHANOGENE 1; BAGl 
ONCOGENE PIM-1; PIMl; PIM 

HYPOXANTHINE GUANESJE PHOSPHORIBOSYLTRANSFERASE 1; HPRTl 
pl60-ROCK 

CARTILAGE-DERT/ED MORPHOGENETIC PROTEIN 1 

CYSTIC FIBROSIS; CF 

TAY-SACHS DISEASE; TSD 

COLLAGEN, TYPE I, ALPHA-1 CHAIN; COLl Al 

GRANULOMATOUS DISEASE, CHRONIC, AUTOSOMAL CYTOCHROME-b-POSITIVE FORM I 
NEUROMATA, MUCOSAL, WITH ENDOCRINE TUMORS 
TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE H; TGFBR2 
GALACTOSEMIA 

MOTHERS AGAINST DECAPENTAPLEGIC (DROSOPHILA) HOMOLOG 2; MADH2 
TRIPLE FUNCTIONAL DOMAIN; TRIO 

ANTI-MULLERIAN HORMONE TYPE II RECEPTOR; AMHR2 

BONE MORPHOGENETIC RECEPTOR TYPE II; BMPR2 

V-RAF MURINE SARCOMA VIRAL ONCOGENE HOMOLOG Bl; BRAF 

CYCLIN A; CCNA 

COMPLEX IV, CYTOCHROME c OXIDASE SUBUNIT HI 
RHODOPSIN; RHO 

V-RAF MURINE SARCOMA 361 1 VIRAL ONCOGENE HOMOLOG 1; ARAFl 
GRANULOMATOUS DISEASE, CHRONIC; CGD 

TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE I; TGFBRl 
ANEMLN., HYPOCHROMIC; ANHl 

GENERAL TRANSCRIPTION FACTOR IIH, POLYPEPTIDE 1 ; GTF2H1 

GLYCOGEN STORAGE DISEASE V 

GAUCHER DISEASE, TYPE I; GD I 

ADENINE PHOSPHORIBOSYLTRANSFERASE; APRT 

V-RAF MURINE SARCOMA 361 1 VIRAL ONCOGENE HOMOLOG 2; ARAF2 
ACTIVIN A RECPPTOR, TYPE I; ACVRl 



Tyrosine Protein Elinases 

NEUROTROPHIC TYROSINE KINASE, RECEPTOR, TYPE 1; NTRKl 

PROTEIN-TYROSINE KINASE, CYTOPLASMIC; PTK2 

SYK-RELATED TYROSINE KINASE; SRK 

PROTEIN TYROSINE KINASE CTK; CTK 

TYR03 PROTEIN TYROSINE KINASE; TYR03 

BRUTON AGAMMAGLOBULINEMIA TYROSINE KCNASE; BTK 

LEUKOCYTE TYROSINE KINASE; LTK 

PROTEIN-TYROSESTE KINASE SYK; SYK 

PROTEIN-TYROSINE KINASE STY 

TEK TYROSINE KINASE, ENDOTHELUL; TEK 

ELK-RELATED TYROSINE KINASE; ERK 

TYROSINE KINASE WITH IMMUNOGLOBULIN AND EGF FACTOR HOMOLOGY DOMAINS; TIE 
PROTEIN TYROSINE KINASE TKF 
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NEUROTROPHIC TYROSINE KINASE, RECEPTOR, TYPE 3; NTRK3 

MIXED-LINEAGE PROTEIN KINASE-3; MLK3 

PROTEIN KINASE, MITOGEN-ACTIVATED 4; PRKM4 

PROTEIN KINASE, MITOGEN-ACTIVATED 1; PRKMl 

PROTEIN TYROSINE KINASE PTK7; PTK7 

PROTEIN TYROSINE KINASE EEK 

MINIBRAIN (DROSOPHILA) HOMOLOG; MNBH 

BONE MARROW KINASE, X-LINKED; BMX 

EPH-LIKE TYROSINE KINASE 1; ETKl 

MACROPHAGE STIMULATING 1 RECEPTOR; MSTIR 

BTK-ASSOCIATED PROTEIN, 135 KD 

LYMPHOCYTE-SPECIFIC PROTEIN TYROSINE KINASE; LCK 
FIBROBLAST GROWTH FACTOR RECEPTOR-2; FGFR2 
PROTEIN TYROSINE KINASE-3; TYK3; FER 
PROTEESr TYROSINE KINASE TXK; TXK 
TEC PROTEESr TYROSINE KINASE; TEC 
PROTEIN TYROSINE KINASE-2; TYK2 

EPH-RELATED RECEPTOR TYROSEME KINASE LIGAND 1; EPLGl 
T-CELL TYROSINE KINASE EMT; EMT 
EPH TYROSINE KINASE 1; EPHTl 

ZONA PELLUCIDA RECEPTOR TYROSINE KINASE, 95 KD; ZRK 

PROTEIN KINASE, MITOGEN-ACTIVATED, KINASE 1; PRKMKl 

EPH TYROSINE KINASE 3; EPHT3 

GROWTH ARREST-SPECIFIC GENE-6; GAS6 

KINASE INSERT DOMAIN RECEPTOR; KDR 

AXL RECEPTOR TYROSINE KINASE; AXL 

FIBROBLAST GROWTH FACTOR RECEPTOR-1; FGFRl 

V-ERB-B2 AVIAN ERYTHROBLASTIC LEUKEMIA VIRAL ONCOGENE HOMOLOG 2; ERBB2 
FMS-LIKE TYROSINE KINASE-3; FLT3 
NEUROEPITHELIAL TYROSINE KINASE; NEP 

NEUROTROPHIC TYROSINE KINASE RECEPTOR-RELATED 3; NTRKR3 
EPH-RELATED RECEPTOR TYROSINE KINASE LIGAND 5; EPLG5 
NEUROTROPHIC TYROSINE KINASE, RECEPTOR, TYPE 2; NTRK2 
RYK RECEPTOR-LIKE TYROSINE KINASE 
TYROSINE KINASE, B-LYMPHOCYTE SPECIFIC; BLK 
EPH TYROSINE KINASE 2; EPHT2 

EPH-RELATED RECEPTOR TYROSINE KINASE LIGAND 2; EPLG2 
GLYCOGEN STORAGE DISEASE VIII 

EPH-RELATED RECEPTOR TYROSINE KINASE LIGAND 7; EPLG7 

JANUS KESTASE 1; JAKl 

FMS-RELATED TYROSINE KINASE-1; FLTl 

PROTEIN KINASE, cAMP-DEPENDENT, REGULATORY, TYPE I, ALPHA; PRKARl A 

WEE-1 TYROSINE KINASE; WEEl 

EPH-LIKE TYROSINE KINASE 2; ETK2 

RECEPTOR TYROSEME KESTASE MuSK 

INSULIN RECEPTOR; INSR 

JANUS KESTASE 3 JAK3 

FMS-RELATED TYROSINE KINASE-3 LIGAND 
PROTEIN KINASE C, BETA 1; PRKCBl 

TYROSINE KINASE-TYPE CELL SURFACE RECEPTOR HER3; HER3 

JANUS KESTASE 2; JAK2 

LIM DOMAIN KINASE 1 ; LIMKl 

DUAL SPECIFICITY PHOSPHATASE 1; DUSPl 

MUSCULAR DYSTROPHY, PSEUDOHYPERTROPHIC PROGRESSIVE, DUCHENNE AND BECKER 
TYPES; DMD 

HEMOPOIETIC CELL KINASE; HCK 

TYROSEME 3-MONOOXYGENASE/TRYPTOPHAN 5-MONOOXYGENASE ACTIVATION PROTEIN 
ETA POLYPEPTIDE; YWHAH 
RET PROTO-ONCOGENE; RET 

TYROSINE 3-MONOOXYGENASE/TRYPTOPHAN 5-MONOOXYGENASE ACTIVATION PROTEIN 
ZETA POLYPEPTIDE; YWHAZ 
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TYROSINE 3-MONOOXYGENASE/TRYPTOPHAN 5-MONOOXYGENASE ACTIVATION PROTEIN 

BETA POLYPEPTIDE; YWHAB 

HEPATOMA TRANSMEMBRANE KINASE; HTK 

MAP KINASE KINASE 6 

PHOSPHATID YLINOSITOL 3-KlNASE, CATALYTIC, ALPHA POLYPEPTIDE; PIK3CA 
CYCLIN-DEPENDENT KINASE INHIBITOR 3; CDKN3 
DL\CYLGLYCEROL KINASE, DELTA, 130 KD 

PROTEIN-TYROSINE PHOSPHATASE, NONRECEPTOR TYPE, 13; PTPN13 
ABELSON MURINE LEUKEMIA VIRAL ONCOGENE HOMOLOG 1; ABLl 
DIACYLGLYCEROL KINASE, ALPHA; DAGKl 
FOCAL ADHESION KINASE 2 

EPITHELL\L DISCOIDIN DOMAIN RECEPTOR 1 ; EDDRl 
ANAPLASTIC LYMPHOMA KINASE; ALK 

PHOSPHATID YLINOSITOL 3-KINASE, CATALYTIC, GAMMA POLYPEPTIDE; PIK3CG 
PHOSPHATID YLINOSITOL 3-KINASE REGULATORY SUBUNIT; PIK3R1 
EPH HOMOLOGY KINASE-1; EHKl 

V-KIT HARDY-ZUCKERMAN 4 FELINE SARCOMA VIRAL ONCOGENE HOMOLOG; KIT 

FIBROBLAST GROWTH FACTOR RECEPTOR-3; FGFR3 

VASCULAR ENDOTHELIAL GROWTH FACTOR C; VEGFC 

MACROPHAGE STIMULATING 1; MSTl 

HYPERCHOLESTEROLEML\, FAMILIAL; FHC 

EPIDERMAL GROWTH FACTOR RECEPTOR; EGFR 

SlOO CALCIUM-BINDING PROTEIN AlO; S1OOA10 

NEUROFIBROMATOSIS, TYPE I; NFl 

ONCOGENE TRK 

LEUKEMIA, CHRONIC MYELOID; CML 

GROWTH FACTOR RECEPTOR-BOUND PROTEIN-7; GRB7 

SlOO CALCIUM-BINDING PROTEIN A4; Si00A4 

RAS p21 PROTEIN ACTIVATOR; RASA2 

ADENOMATOUS POLYPOSIS OF THE COLON; APC 

METPROTO-ONCOGENE; MET 

SIGNAL TRANSDUCER AND ACTIVATOR OF TRANSCRIPTION 3; STAT3 
smg GDS-ASSOCIATED PROTEIN 
UBIQUITIN-BINDING PROTEIN P62 
LYMPHOCYTE CYTOSOLIC PROTEIN 2; LCP2 

EPIDERMAL GROWTH FACTOR RECEPTOR PATHWAY SUBSTRATE-15; EPS 15 
GROWTH FACTOR RECEPTOR-BOUND PROTEIN-10; GRBIO 

GLIAL CELL LINE-DERIVED NEUROTROPHIC FACTOR RECEPTOR-ALPHA; GDNFRA 
SHC TRANSFORMING PROTEIN; SHCl 
CYSTIC FIBROSIS; CF 
TROPOMYOSIN 3; TPM3 

CELL DIVISION CYCLE 2, Gl TO S AND G2 TO M; CDC2 
MUSCULAR DYSTROPHY, LIMB GIRDLE, TYPE 2C; LGMD2C 
ASH PROTEIN 
TAY-SACHS DISEASE; TSD 
AGRIN; AGRN 

SlOO CALCIUM-BINDING PROTEIN A6; S100A6 
AGRANULOCYTOSIS, INFANTILE GENETIC 
TRIPLE FUNCTIONAL DOMAIN; TRIO 

HYPOXANTHINE GUANINE PHOSPHORIBOSYLTRANSFERASE 1; HPRTl 
CYTOVILLIN 

GOLGI APPARATUS PROTEIN 1; GLGl 

GROWTH FACTOR RECEPTOR-BOUND PROTEIN-14; GRB14 

V-FES FELINE SARCOMA VIRAITV-FPS FUJINAMI AVIAN SARCOMA VIRAL ONCOGENE 
HOMOLOG; FES 

TRANSLOCATED PROMOTER REGION 

P32 SPLICING FACTOR SF2 ASSOCIATED PROTEIN 

CARTILAGE-DERIVED MORPHOGENETIC PROTEIN 1 

PAIRED BOX HOMEOTIC GENE 5; PAX5 

INSULIN RECEPTOR SUBSTRATE 1; IRSl 

SON OF SEVENLESS (DROSOPHILA^ HOMOLOG 2; SOS2 
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PHOSPHATIDYLINOSITOL GLYCAN CLASS A; PIGA 
ICHTHYOSIS, X-LINKED 
RHODOPSIN; RHO 

SEVERE COMBINED IMMUNGDEFICrENCY DISEASE, AUTOSOMAL RECESSIVE T-NEGATIVE/B- 
POSITIVE TYPE ' iNcvjA A X V c/o 

CAS-BR-M (MURINE) ECOTROPIC RETROVIRAL TRANSFORMING SEQUENCE- CBL 
TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE H; TGFBR2 
COLONY-STIMULATING FACTOR-1 RECEPTOR; CSFIR 
PHOSPHODIESTERASE I/NUCLEOTIDE PYROPHOSPHATASE 1- PDNPl 
NUCLEOPHOSMIN; NPMl 
ADDUCIN 1; ADDl 

NEUROMATA, MUCOSAL, WITH ENDOCRINE TUMORS 
HYALURONAN-MEDIATED MOTILITY RECEPTOR; HMMR 
ESTROGEN RECEPTOR; ESR 
SRC-LIKE ADAPTER; SLA 
BECKWITH-WIEDEMANN SYNDROME; BWS 
TYLOSIS WITH ESOPHAGEAL CANCER; TOC 
PLACENTAL GROWTH FACTOR; PGF 
ETS VARIANT GENE 6; ETV6 

MEMBRANE COMPONENT, CHROMOSOME 6, POLYPEPTIDE 2; M6P2 
INTERLEUKIN-4; IL4 

GARDNER-RASHEED FELINE SARCOMA VIRAL (V-FGR) ONCOGENE; PGR 

FIBROBLAST GROWTH FACTOR-8; FGF8 

SORTING NEXIN- 1 ; SNX 1 

TRANSCRIPTION FACTOR 1, HEPATIC; TCFl 

HEPATOCYTE GROWTH FACTOR; HGF 

INTERLEUKIN-6 RECEPTOR; IL6R 

V-YES-1 YAMAGUCHI SARCOMA VIRAL ONCOGENE; YESl 
ENDOGLIN; ENG 

THANATOPHORIC DYSPLASIA WITH KLEEBLATTSCHAEDEL 

HEMATOPOIETIC CELL-SPECIFIC LYN SUBSTRATE 1; HCLSl 

GENERAL TRANSCRIPTION FACTOR IIH, POLYPEPTIDE 1 ; GTF2H1 

V-SIS PLATELET-DERIVED GROWTH FACTOR BETA POLYPEPTIDE; PDGFB 

DIABETES MELLITUS, AUTOSOMAL DOMINANT 

PROGRAMMED CELL DEATH 1; PDCDl 

TRANSFORMING GROWTH FACTOR-BETA RECEPTOR, TYPE I; TGFBRl 
EPIDERMAL GROWTH FACTOR RECEPTOR PATHWAY SUBSTRATE-8; EPS8 
VASCULAR ENDOTHELIAL GROWTH FACTOR; VEGF 
CELL ADHESION REGULATOR; CAR 

ANEMIA, CONGENITAL HYPOPLASTIC, OF BLACKFAN AND DIAMOND 
GAUCHER DISEASE, TYPE I; GD I 
MAPLE SYRUP URINE DISEASE; MSUD 

MCF»2 CELL LINE DERIVED TRANSFORMING SEQUENCE; MCF2 
GRANULOMATOUS DISEASE, CHRONIC; CGD 
ANGIOPOIETIN 2; ANGPT2 

HYPOGAMMAGLOBULINEMIA AND ISOLATED GROWTH HORMONE DEFICIENCY, X-LINKED 
GLIAL CELL LINE-DERIVED NEUROTROPHIC FACTOR RECEPTOR-BETA 
H4 GENE 
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CLAIMS 

What is claimed is: 

L A kinase scaffold library comprising at least one set of compounds, each set 
consisting essentially of a plurality of compounds of a chemical structure selected from the 
group consisting of Formula I, n, HI, IV, V, VI, and Vn. 

2. The library of claim 1, wherein said set comprises at least 50 different 
compounds. 

3. The library of claim 1 , wherein said library comprises at least 3 said sets. 

4. The library of claim 1, wherein a majority of compoxmds in said set have been 
demonstrated to bind to one or more kinases. 

5. The library of claim 4, wherein said kinases comprise a plurality of kinases 
selected from the group consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, 
cKit, Pkcp, p38, Cdk2, Akt, Gsk3p. 

6. The library of claim 1, wherein each said compound is in a separate well in a 
plate or plxirality of plates. 

7. A system for fitting compounds in binding sites of one or more protein kinases, 
comprising 

an electronic kinase scaffold library comprising at least one collection of electronic 
representations of compounds of a chemical structure selected from the group consisting of 
Foraiula 1, 11, III, IV, V, VI, and VII, wherein said kinase scaffold library is embedded in a 
computer memory device, wherein said electronic representations of said compounds can be 
selectively retrieved and ftinctionally connected with computer software adapted to fit 
electronic representations of compounds in an electronic representation of a binding site of a 
kinase. 
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8. The system of claim 7, further comprising at least one electronic representation 
of a kinase binding site embedded in computer memory such that said electronic 
representation of a kinase binding site can be functionally connected with said computer 
software. 

9. The system of claim 8, where said electronic representation of kinase binding 
sites comprises electronic representations of binding sites of a plurality of kinases selected 
from the group consisting of PIM~1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, PkcP, 
p38, Cdk2, Akt, GskSp. 



10. A method for obtaining improved ligands binding to a protein kinase, 
comprising 

detemiining whether a derivative of a compound of Formula I, II, III, IV, V, VI, or 
Vn binds to said kinase with greater affinity or greater specificity or both than said 
compound, wherein binding with greater affinity or greater specificity or both indicates that 
said derivative is an improved ligand. 

11. The method of claim 10, wherein said derivative has at least 10-fold greater 
affinity or specificity or both than said compound. 

12. The method of claim 10, wherein said derivative has at least 100-fold greater 
affinity or specificity or both. 

13. The method of claim 10, wherein said kinase is selected from ttie group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
Gsk3p- 

14. A method for developing ligands specific for a particular kinase, comprising 
determining whether a derivative of a compound of Formula I, II, III, IV, V, VI, or 

VII that binds to a plurality of kinases has greater specificity for said particular kinase than 
said compound. 
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15. The method of claim 14, wherein said compound binds to said kinase with an 
affinity at least 10-fold greater than for binding to any of said plxirality of kinases. 

16. The method of claim 15, wherein said compound interacts with at least one of 
PIM-1 residues 49, 52, 65, 67, 121, 128, and 186. 

17. The method of claim 14, wherein said compound binds weakly to said plurality 
of kinases. 

18. The method of claim 14, wherein said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
GskSp. 

19. A method for developing ligands binding to a kinase, comprising 
determining the orientation of at least one molecular scaffold of Formula I, II, III, 

IV, V, VI, or Vn in co-crystals with said kinase; and 

identifying chemical structures of said molecular scaffolds, that, when modified, 
alter the binding affinity or binding specificity or both between the molecular scaffold and 
said kinase; and 

synthesizing a ligand wherein one or more of the chemical structures of the 
molecular scaffold is modified to provide a ligand that binds to said kinase with altered 
binding affinity or binding specificity or both. 

20. The method of claim 19, wherein said molecular scaffold is a weak binding 
compound. 

21. The method of claim 19, wherein said molecular scaffold binds to a plurality of 
kinases. 

22. The method of claim 19, wherein said molecular scaffold interacts with at least 
3 kinases selected from the group consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, 
EGFR, cKit, Pkcp, p38, Cdk2, Akt, and Gsk3p. 

. 23. The method of claim 19, wherein said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, cKit, Pkcp, Cdk2, and Akt* 
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24. A method for developing ligands with increased specificity on a kinase, 
comprising 

testing a derivative of a kinase binding compoimd of Formula 1, 11, III, IV, V, VI, or 
Vn for increased specificity on said kinase, wherein increased specificity is indicative that 
said derivative is a ligand with increased specificity. 

25. The method of claim 24, wherein said kinase binding compound binds to at 
least 5 different human kinases. 

26. The method of claim 24, wherein said kinase binding compound binds to at 
least 10 different himian kinases. 

27. The method of claim 24, wherein said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
GskSp.. 

28. A method for identifying a hgand binding to a kinase, comprising 
determining whether a derivative compound that includes a core structure selected 

from the group consisting of Formula I, H, III, IV, V, VI, and VII binds to said kinase with 
altered binding affinity or specificity or both as compared to the parent compound. 

29. A co-crystal of a kinase and a binding compound of Formula I, H, in, IV, V, VI, 

or vn. 

30. The co-crystal of claim 29, wherein said binding compoimd interacts with at 
least one of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186. 

31. The co-crystal of claim 29, wherein said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
Gsk3p. 

32. The co-crystal of claim 29, wherein said co-crystal is in an X-ray beam. 
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33. A method for obtaining co-crystals of PIM-1 with a binding compound of 
Formula I, II, III, IV, V, VI, or VII, comprising subjecting PIM-1 protein at 5-20 mg/ml to 
crystallization conditions substantially equivalent to Hampton Screen 1 conditions 2, 7, 14, 
17, 23, 25, 29, 36, 44, or 49 in the presence of binding compoimd for a time sufficient for 
crystal development. 

34. The method of claim 33, wherein said binding compound is added to said 
protein to a final concentration of 0.5 to 1.0 mM. 

35. The method of claim 34, wherein said binding compound is in a dimethyl 
sulfoxide solution. 

36. The method of claim 33, wherein said crystallization condition is 0.4-0.9 M 
sodium acetate trihydrate pH 6.5, 0.1 M imidazole; or 0.2-0.7 M. sodium potassium tartrate, 
00.1 M MES buffer pH 6.5. 

37. A method for modulating kinase activity, comprising 

contacting said kinase with a compound of Formula I, II, HI, IV, V, VI, or VII. 

38. The method of claim 37, wherem said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cBCit, Pkcp, p38, Cdk2, Akt, 
Gsk3p. 

39. The method of claim 37, wherein said compound is at a concentration of 200 
jaM or less. 

40. A method for treating a patient suffering from a disease or condition 
characterized by abnormal kinase activity, comprising 

administering to said patient a compound of Formula 1, 11, III, IV, V, VI, or VII 
active on said kinase. 
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41. The method of claim 40, wherein said kinase is selected from the group 
consisting of PIM^l, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
GskSp. 

42. The method of claim 40 wherein said disease or condition is a cancer. 

43. The method of claim 40, whereni said disease or condition is an inflammatory 
disease or condition. 



44. An electronic representation of a binding site of a kinase with a compound of a 
chemical structure selected from the group consisting of Formula I, II, III, IV, V, VI, and 
Vn bound therein. 

45. The electronic representation of claim 44, comprising a binding site surface 
contour. 

46. The electronic representation of claim 44, comprising representations of the 
binding character of a plurality of conserved amino acid residues. 

47. A method for identifying potential kinase binding compounds, comprising 
fitting at least one electronic representations of a compound of Formula I, II, III, IV, 

V, VI, or VII in an electronic representation of a kinase binding site. 

48. The method of claim 47, wherein said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, PkcP, p38, Cdk2, Akt, 
Gsk3p. 

49. The method of claim 48, comprising 

removing a computer representation of a compound complexed vsath said kinase and 
fitting a computer representation of a compound from a computer database with a computer 
representation of the active site of said kinase; and 

identifying compounds that best fit said active site based on favorable geometric fit 
and energetically favorable complementary interactions as potential binding compounds. 
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50. The method of claim 48, comprising 

modifying a computer representation of a compound complexed with said kinase by 
the deletion or addition or both of one or more chemical groups; 

fitting a computer representation of a compound from a computer database with a 
computer representation of the active site of said kinase; and 

identifying compounds that best fit said active site based on favorable geometric fit 
and energetically favorable complementary interactions as potential binding compoimds. 

51. The method of claim 48, comprising 

removing a computer representation of a compound complexed with said kinase; and 
searching a database for compounds having structural similarity to said compound 
using a compound searching computer program or replacing portions of said compound 
with similar chemical structures using a compound construction computer program. 

52. A method for attaching a kinase binding compound to an attachment 
component, comprising 

identifying energetically allowed sites for attachment of a said attachment 
component on a kinase binding corapoimd of Formula 1, 11, III, IV, V, VI, or VII; and 

attaching said compound or derivative thereof to said attachment component at said . 
energetically allowed site. 

53 • The method of claim 52, wherein said attachment component is a linker for 
attachment to a solid phase medium, and said method fiirther comprises attaching said 
compound or derivative to a solid phase medium through a linker attached at a said 
energetically allowed site. 

54. The method of claim 52, wherein said kinase is selected fi-om the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
GskSp. 

55. The method of claim 52, wherein said kinase comprises conserved residues 
matching at least one of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186. 

56. The method of claim 53, wherein said liiiker is a traceless linker. 
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57. The method of claim 53, wherein said kinase binding compound or derivative 
thereof is synthesized on a said linker attached to said solid phase medium. 

58. The method of claim 57, wherein a plurality of said compounds or derivatives 
are synthesized in combinatorial synthesis. 

59. The method of claim 53, wherein attachment of said compound to said solid 
phase mediimi provides an affinity medium. 

60. The method of claim 52, wherein said attachment component comprises a label. 

61 . The method of claim 60, wherein said label comprises a fluorophore. 

62. A modified compound, comprising 

a compound of Formula I, HI, IV, V, VI, or Vn, with a linker moiety attached 

thereto. 

63. The compound of claim 62, wherein said linker is attached to an energetically 
allowed site for binding of said modified compound to a kinase selected fi-om the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
GskSp. 

64. The compound of claim 62, wherein said linker is attached to a solid phase. 

65. The compound of claim 62, wherein said linker comprises or is attached to a 

label. 

66. The compound of claim 62, wherein said linker is a traceless linker. 



67. A method for developing a ligand for a kinase comprising conserved residues 
matching one or more of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186, comprising 

determining whether a compound of Formula I, H, III, IV, V, VI, or VII binds to 
said kinase. 
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68. The method of claim 67, wherein said kinase is selected from the group 
consisting of PIM-1, Pyk2, c-Abl, Her2, cMet, VEGFR, EGFR, cKit, Pkcp, p38, Cdk2, Akt, 
GskSp. 

69. The method of claim 67, wherein said kinase comprises conserved residues 
matching at least 2 of PIM-1 residues 49, 52, 65, 67, 121, 128, and 186. 

70. The method of claim 67, wherein said kinase comprises conserved residues 
matching PIM-1 residues 49, 52, 65, 67, 121, 128, and 186. 

71 . The method of claim 67, further comprising determining whether said 
compound modulates said kinase. 

72- The method of claim 67, wherein said determining comprises computer fitting 
said compound in a binding site of said kinase. 

73. The method of claim 67, further comprising forming a co-crystal of said kinase 
and said compoimd. 

74. The method of claim 73, further comprising determining the binding orientation 
of said compound with said kinase. 
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